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Preface 


Except in this paragraph and on the title page, all 
instances of “PowerPC AS” in this document should 
be interpreted as referring to the tags inactive subset 
of the full PowerPC AS Architecture unless otherwise 
stated. 


This document defines the PowerPC AS User Instruc- 
tion Set Architecture. It covers the base instruction 
set and related facilities available to the application 
programmer. 


Other related documents define the PowerPC AS 
Virtual Environment Architecture, the PowerPC AS 
Operating Environment Architecture, and PowerPC AS 
Implementation Features. Book Il, PowerPC AS 
Virtual Environment Architecture defines the storage 
model and related instructions and facilities available 
to the application programmer, and the time-keeping 
facilities available to the application programmer. 
Book Ill, PowerPC AS Operating Environment Archi- 
tecture defines the system (privileged) instructions 
and related facilities. Book IV, PowerPC AS Imple- 
mentation Features defines the implementa- 
tion-dependent aspects of a particular 
implementation. 


As used in this document, the term “PowerPC AS 
Architecture” refers to the instructions and facilities 
described in Books |, Il, and Ill. The description of the 
instantiation of the PowerPC AS Architecture in a 
given implementation includes also the material in 
Book IV for that implementation. 


Engineering Note 


The PowerPC AS Architecture permits implementa- 
tion-specific extensions to the architecture to be 
defined in Book IV. This Note provides guidelines 
and limitations on the features that are permitted 
to be defined in that book. Any exceptions to the 
guidelines and limitations must be approved in 
advance by the PowerPC AS Architecture process. 


To understand the terminology used in this Note it 
may be necessary to refer to Book Il and Book III. 
In particular, the term “privileged state” means a 
processor state in which nearly all resources of the 
architecture are accessible (typically the state in 
which operating systems run) and the term 
“oroblem state” means a processor state in which 
spe ogee resources are not accessible plea 
the state in which application programs run). ( 
few resources are accessible only in "hypervisor 
state”; see the section entitled "Logical Parti- 
tioning (LPAR)” in Book III.) 


m The only architecture resources (e.g., opcodes, 
SPR numbers, interrupt vector locations, bits 
in defined registers and in defined storage 
tables) that may be used for implementa- 
tion-specific differences or extensions are 
those explicitly identified in Book |, Il, or Ill as 
reserved for implementation-specific use. 

m It is imperative that fragmentation of the soft- 
ware base be avoided. Application software 
must be able to run without change on all 
implementations. Operating system software 
that te the programming model described 
in Book [Il must run without change on imple- 
mentations that claim to conform to Book III. 
Any difference or extension that is likely to 
fragment the software base is prohibited. 
Examples include but are not limited to the fol- 


lowing: 

— Features, including instructions and regis- 

ters, that are accessible in problem state. 

— Mechanisms that control whether a 
feature is accessible in problem state. 

— Privileged features, including instructions 
and registers, that provide functions 
useful primarily to application software. 

m [tis permissible to provide a privileged control 
mechanism that can be used to alter the 
behavior of a defined feature for use in per- 
forming infrequent operations associated with 
System initialization and the like. An example 
is a control mechanism that causes a TLB 
invalidation instruction to interpret an operand 
as specifying the physical TLB entry to be 
invalidated, enabling software to invalidate all 
TLB entries during system initialization. 

m Any implementation-specific resource having 
the property that alteration of the resource by 
a processor in one partition could affect the 
integrity of other partitions must be a 
hypervisor resource; see the Book Ill section 
cited above. 
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User Responsibilities 


Do not make any unauthorized alterations to the 
document (user notes are permitted). 


Destroy the entire document when it is super- 
seded, obsolete, or no longer needed. 


Distribute copies of the document or portions of 
the document only to authorized persons. 


Verify the version prior to use. The version ver- 
ification procedure is described later in this 
preface. 


Verify completeness prior to use. The last page 
is labeled "Last Page - End of Document”. The 
end of the Table of Contents shows the last page 
number. 


Version Verification 


See the PowerPC AS representative for your company. 


m Report any deviations from these procedures to 
the document owner. 


Next Scheduled Review 

There is no scheduled review. 

Approval Process 

The process used by the Processor Architecture 
Review Board (PARB) to approve or reject changes 
proposed for this architecture is documented at the 
following DFS directory: 
l...laustin.ibm.com/fs/projects/utds/server arch/process 


Approvals 


This version has been approved by the PARB. 
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1.1 Overview 


This chapter describes computation modes, compat- 
ibility with the POWER Architecture, document con- 
ventions, a processor overview, instruction formats, 
storage addressing, and instruction fetching. 


1.2 Computation Modes 


The PowerPC AS Architecture requires a 64-bit imple- 
mentation, in which all registers except some Special 
Purpose Registers are 64 bits long and effective 
addresses are 64 bits long. All implementations have 
two modes of operation: 64-bit mode and 32-bit mode. 
The mode controls how the effective address is inter- 
preted, how status bits are set, and how the Count 


Register is tested by Branch Conditional instructions. 
All instructions are available in both modes. In both 
64-bit mode and 32-bit mode, instructions that set a 
64-bit register affect all 64 bits, and the value placed 
into the register is independent of mode. In both 
modes, effective address computations use all 64 bits 
of the relevant registers (General Purpose Registers, 
Link Register, Count Register, etc.) and produce a 
64-bit result. However, in 32-bit mode, the high-order 
32 bits of the computed effective address are ignored 
when accessing data and are set to O when fetching 
instructions. 


The PowerPC AS Architecture does not permit an 
implementation that provides only the equivalent of 
32-bit mode (i.e., an implementation in which all regis- 
ters except Floating-Point Registers are 32 bits long). 
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1.3 Instruction Mnemonics and 
Operands 


The description of each instruction includes the mne- 
monic and a formatted list of operands. Some exam- 
ples are the following. 


stw RS,D(RA) 
addis RT,RA,SI 


PowerPC AS-compliant Assemblers will support the 
mnemonics and operand lists exactly as shown. They 
should also provide certain extended mnemonics, as 
described in Appendix B, “Assembler Extended 
Mnemonics” on page 143. 


1.4 Compatibility with the 
POWER Architecture 


The PowerPC AS Architecture provides binary com- 
patibility for POWER application programs, except as 
described in Appendix E, “Incompatibilities with the 
POWER Architecture” on page 163. 


Many of the PowerPC AS instructions are identical to 
POWER instructions. For some of these the PowerPC 
AS instruction name and/or mnemonic differs from 
that in POWER. To assist readers familiar with the 
POWER Architecture, POWER mnemonics are shown 
with the individual instruction descriptions when they 
differ from the PowerPC AS mnemonics. Also, 
Appendix D, “Cross-Reference for Changed POWER 
Mnemonics” on page 161 provides a cross-reference 
from POWER mnemonics to PowerPC AS mnemonics 
for the instructions in Books |, Il, and III. 


References to the POWER Architecture include 
POWER2 implementations of the POWER Architecture 
unless otherwise stated. 


The 
throughout the PowerPC AS Architecture documents. 


1.5 Document Conventions 


1.5.1 Definitions and Notation 


following definitions and notation are used 


A program is a sequence of related instructions. 


Quadwords are 128 bits, doublewords are 64 bits, 
words are 32 bits, halfwords are 16 bits, and 
bytes are 8 bits. 


All numbers are decimal unless specified in some 
special way. 


— Obnnnn means a number expressed in binary 
format. 

— Oxnnnn means a 
hexadecimal format. 


number expressed in 


Underscores may be used between digits. 


RT, RA, R1, ... refer to General Purpose Regis- 
ters. 
FRT, FRA, FR1, ... refer to Floating-Point Regis- 
ters. 


(x) means the contents of register x, where x is 
the name of an instruction field. For example, 
(RA) means the contents of register RA, and 
(FRA) means the contents of register FRA, where 
RA and FRA are instruction fields. Names such 
as LR and CTR denote registers, not fields, so 
parentheses are not used with them.  Paren- 
theses are also omitted when register x is the 
register into which the result of an operation is 
placed. 


(RA|O) means the contents of register RA if the 
RA field has the value 1-31, or the value 0 if the 
RA field is 0. 


Bits in registers, instructions, and fields are spec- 
ified as follows. 


— Bits are numbered left to right, starting with 
bit 0. 

— Ranges of bits are specified by two numbers 
separated by a colon (:). The range p:q con- 
sists of bits p through q. 


Xp means bit p of register/field X. 


Xp: Means bits p through q of register/field X. 


p:q 
Xp q... means bits p, q, ... of register/field X. 


(RA) means the one's complement of the con- 
tents of register RA. 


Field i refers to bits 4xi through 4xi+3 of a reg- 
ister. 


A period (.) as the last character of an instruction 
mnemonic means that the instruction records 
status information in certain fields of the Condi- 
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tion Register as a side effect of execution, as 
described in Chapter 2 through Chapter 4. 


The symbol | is used to describe the concat- 
enation of two values. For example, 010 | 111 is 
the same as 010111. 


x" means x raised to the nt? power. 


"x means the replication of x, n times (i.e., x con- 
catenated to itself n- 1 times). "0 and "1 are 
special cases: 


— "0 means a field of n bits with each bit equal 
to 0. Thus 50 is equivalent to 0b00000. 

— "1 means a field of n bits with each bit equal 
to 1. Thus 51 is equivalent to 0b11111. 


Positive means greater than zero. 
Negative means less than zero. 


A system library program is a component of the 
system software that can be called by an applica- 
tion program using a Branch instruction. 


A system service program is a component of the 
system software that can be called by an applica- 
tion program using a System Call instruction. 


The system trap handler is a component of the 
system software that receives control when the 
conditions specified in a Trap instruction are sat- 
isfied. 


The system error handler is a component of the 
system software that receives control when an 
error occurs. The system error handler includes 
a component for each of the various kinds of 
error. These error-specific components are 
referred to as the system alignment error 
handler, the system data storage error handler, 
etc. 


Each bit and field in instructions, and in status 
and control registers (XER and FPSCR) and 
Special Purpose Registers, is either defined or 
reserved. 


1, Il, 111, ... denotes a reserved field in an instruc- 
tion. 


Latency refers to the interval from the time an 
instruction begins execution until it produces a 
result that is available for use by a subsequent 
instruction. 


Unavailable refers to a resource that cannot be 
used by the program. For example, storage is 
unavailable if access to it is denied. See Book III, 
PowerPC AS Operating Environment Architecture. 


The results of executing a given instruction are 
said to be boundedly undefined if they could have 
been achieved by executing an arbitrary 
sequence of instructions, starting in the state the 
machine was in before executing the given 
instruction. Boundedly undefined results for a 
given instruction may vary between implementa- 


tions, and between different executions on the 
same implementation, and are not further defined 
in this document. 


m The sequential execution model is the model of 
program execution described in Section 2.2, 
"Instruction Fetching" on page 17. 


1.5.2 Reserved Fields 


All reserved fields in instructions should be zero. If 
they are not, the instruction form is invalid; see 
Section 1.9.2, "Invalid Instruction Forms" on page 12. 


The handling of reserved bits in System Registers 
(e.g., XER, FPSCR) is implementation-dependent. 
Unless otherwise stated, software is permitted to 
write any value to such a bit. A subsequent reading 
of the bit returns O if the value last written to the bit 
was 0 and returns an undefined value (0 or 1) other- 
wise. 


Programming Note 


It is the responsibility of software to preserve bits 
that are now reserved in System Registers, as 
they may be assigned a meaning in some future 
version of the architecture. 


In order to accomplish this preservation in imple- 
mentation-independent fashion, software should 
do the following. 


m Initialize each such register supplying zeros 
for all reserved bits. 

m Alter (defined) bit(s) in the register by reading 
the register, altering only the desired bit(s), 
and then writing the new value back to the 
register. 


The XER and FPSCR are partial exceptions to this 
recommendation. Software can alter the status 
bits in these registers, preserving the reserved 
bits, by executing instructions that have the side 
effect of altering the status bits. Similarly, soft- 
ware can alter any defined bit in the FPSCR by 
executing a Floating-Point Status and Control Reg- 
ister instruction. Using such instructions is likely 
to yield better performance than using the method 
described in the second item above. 


When a currently reserved bit is subsequently 
assigned a meaning, every effort will be made to 
have the value to which the system initializes the 
bit correspond to the "old behavior". 


Engineering Note 


Reserved bits in System Registers need not be 
implemented. 
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1.5.3 Description of Instruction 
Operation 


A formal description is given of the operation of each 
instruction. In addition, the operation of most 
instructions is described by a semiformal language at 
the register transfer level (RTL). This RTL uses the 
notation given below, in addition to the definitions and 
notation described in Section 1.5.1, “Definitions and 
Notation” on page 2. Some of this notation is also 
used in the formal descriptions of instructions. RTL 
notation not summarized here should be self- 
explanatory. 


The RTL descriptions cover the normal execution of 
the instruction, except that “standard” setting of the 
Condition Register, Fixed-Point Exception Register, 
and Floating-Point Status and Control Register are not 
shown. (“Non-standard” setting of these registers, 
such as the setting of the Condition Register by the 
Compare instructions, is shown.) The RTL 
descriptions do not cover cases in which the system 
error handler is invoked, or for which the results are 
boundedly undefined. 


The RTL descriptions specify the architectural trans- 
formation performed by the execution of an instruc- 


tion. They do not imply any particular 
implementation. 

Notation Meaning 

¢ Assignment 

€ ¡sa Assignment of an instruction effec- 


tive address. In 32-bit mode the 
high-order 32 bits of the 64-bit target 
address are set to 0. 
4 NOT logical operator 
Two's complement addition 
Two's complement subtraction, unary 
minus 
Multiplication 
Division (yielding quotient) 
Square root 
Equals, Not Equals relations 
,2 Signed comparison relations 
Unsigned comparison relations 
Unordered comparison relation 
AND, OR logical operators 
; Exclusive OR, Equivalence 
operators ((a=b) = (a®-7b)) 
Absolute value of x 
Least integer > x 
Result of converting x from floating- 
point single format to floating-point 
double format, using the model 
shown on page 98 
Result of extending x on the left with 
sign bits 
Greatest integer < x 
General Purpose Register x 


+ 


x 


Q go 3 AA I med 


logical 
ABS(x) 


CEIL(x) 
DOUBLE(x) 


EXTS(x) 


FLOOR(x) 
GPR(x) 


MASK(x, y) 


MEM(x, y) 


ROTLg4(X, y) 


ROTLgo(x, y) 


SINGLE(x) 


SPREG(x) 
TRAP 
characterization 


undefined 


CIA 


NIA 


if ... then ... else ... 


do 


Mask having 1s in positions x 
through y (wrapping if x > y) and Os 
elsewhere 

Contents of y bytes of storage 
starting at address x. In 32-bit mode 
the high-order 32 bits of the 64-bit 
value x are ignored. 

Result of rotating the 64-bit value x 
left y positions 

Result of rotating the 64-bit value x |x 
left y positions, where x is 32 bits 
long 

Result of converting x from floating- 
point double format to floating-point 
single format, using the model shown 
on page 101 

Special Purpose Register x 

Invoke the system trap handler 
Reference to the setting of status 
bits, in a standard way that is 
explained in the text 

An undefined value. The value may 
vary between implementations, and 
between different executions on the 
same implementation. 

Current Instruction Address, which is 
the 64-bit address of the instruction 
being described by a sequence of 
RTL. Used by relative branches to 
set the Next Instruction Address 
(NIA), and by Branch instructions 
with LK=1 to set the Link Register. 
In 32-bit mode the high-order 32 bits 
of CIA are always set to 0. Does not 
correspond to any architected reg- 
ister. 

Next Instruction Address, which is 
the 64-bit address of the next 
instruction to be executed. For a 
successful branch, the next instruc- 
tion address is the branch target 
address: in RTL, this is indicated by 
assigning a value to NIA. For other 
instructions that cause non- 
sequential instruction fetching (see 
Book Ill, PowerPC AS Operating 
Environment Architecture), the RTL 
is similar. For instructions that do 
not branch, and do not otherwise 
cause instruction fetching to be non- 
sequential, the next instruction 
address is CIA+4. In 32-bit mode 
the high-order 32 bits of NIA are 
always set to 0. Does not corre- 
spond to any architected register. 
Conditional execution, indenting 
shows range; else is optional 

Do loop, indenting shows range. 
“To” and/or “by” clauses specify 
incrementing an iteration variable, 
and a “while” clause gives termi- 
nation conditions. 
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leave Leave innermost do loop, or do loop 
described in leave statement 
for For loop, indenting shows range. 


Clause after “for” specifies the enti- 
ties for which to execute the body of 
the loop. 


The precedence rules for RTL operators are summa- 
rized in Table 1. Operators higher in the table are 
applied before those lower in the table. Operators at 
the same level in the table associate from left to 
right, from right to left, or not at all, as shown. (For 
example, — associates from left to right, so a- b- c = 
(a-b)- c.) Parentheses are used to override the eval- 
uation order implied by the table or to increase 
clarity; parenthesized expressions are evaluated 
before serving as operands. 


Table 1. Operator precedence 


Operators 


Associativity 
left to right 
right to left 


subscript, function evaluation 


pre-superscript (replication), 
post-superscript (exponentiation) 


unary —,7 right to left 


left to right 
left to right 
left to right 


left to right 


left to right 
left to right 


1.6 Processor Overview 


The processor implements the instruction set, the 
storage model, and other facilities defined in this doc- 
ument. Instructions that the processor can execute 
fall into three classes: 


m branch instructions 
m fixed-point instructions 
m floating-point instructions 


Branch instructions are described in Section 2.4, 
“Branch Processor Instructions” on page 20. Fixed- 
point instructions are described in Section 3.3, “Fixed- 
Point Processor Instructions” on page 31. 


Floating-point instructions are described in Section 
4.6, “Floating-Point Processor Instructions” on 
page 97. 


Fixed-point instructions operate on byte, halfword, 
word, and doubleword operands. Floating-point 
instructions operate on single-precision and double- 
precision floating-point operands. The PowerPC AS 
Architecture uses instructions that are four bytes long 
and word-aligned. It provides for byte, halfword, 
word, and doubleword operand fetches and stores 
between storage and a set of 32 General Purpose 
Registers (GPRs). It also provides for word and 
doubleword operand fetches and stores between 
storage and a set of 32 Floating-Point Registers 
(FPRs). 


Signed integers are represented in two's complement 
form. 


There are no computational instructions that modify 
storage. To use a storage operand in a computation 
and then modify the same or another storage 
location, the contents of the storage operand must be 
loaded into a register, modified, and then stored back 
to the target location. Figure 1 is a logical represen- 
tation of instruction processing. Figure 2 on page 6 
shows the registers of the PowerPC AS User Instruc- 
tion Set Architecture. 


Branch 
m) Processing 


Fixed-Point and 
Floating-Point 


Instructions 
Fixed-Pt Float-Pt 
Processing Processing 


Data to/from 
Storage 


| 


Storage 


Instructions 
from Storage 


Figure 1. Logical processing model 
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0 31 
| LR | Link Register (page 19) 
0 63 
CTR Count Register (page 19) 
0 63 


Figure 2. PowerPC AS user register set 


1.7 Instruction Formats 


All instructions are four bytes long and word-aligned. 
Thus, whenever instruction addresses are presented 
to the processor (as in Branch instructions) the low- 
order two bits are ignored. Similarly, whenever the 
processor develops an instruction address the low- 
order two bits are zero. 


Bits 0:5 always specify the opcode (OPCD, below). 
Many instructions also have an extended opcode (XO, 
below). The remaining bits of the instruction contain 
one or more fields as shown below for the different 
instruction formats. 


The format diagrams given below show horizontally 
all valid combinations of instruction fields. The dia- 
grams include instruction fields that are used only by 
instructions defined in Book Il, PowerPC AS Virtual 
Environment Architecture, or in Book Ill, PowerPC AS 
Operating Environment Architecture. 


In some cases an instruction field is reserved, or 
must contain a particular value. If a reserved field 


Condition Register (page 18) 


General Purpose Registers (page 29) 


Fixed-Point Exception Register (page 30) 


Floating-Point Registers (page 82) 


Floating-Point Status and Control Register (page 83) 


does not have all bits set to O, or if a field that must 
contain a particular value does not contain that value, 
the instruction form is invalid and the results are as 
described in Section 1.9.2, “Invalid Instruction Forms” 
on page 12. 


Split Field Notation 


In some cases an instruction field occupies more than 
one contiguous sequence of bits, or occupies one con- 
tiguous sequence of bits that are used in permuted 
order. Such a field is called a split field. In the 
format diagrams given below and in the individual 
instruction layouts, the name of a split field is shown 
in small letters, once for each of the contiguous 
sequences. In the RTL description of an instruction 
having a split field, and in certain other places where 
individual bits of a split field are identified, the name 
of the field in small letters represents the concat- 
enation of the sequences from left to right. In all 
other places, the name of the field is capitalized and 
represents the concatenation of the sequences in 
some order, which need not be left to right, as 
described for each affected instruction. 
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1.7.1 l-Form 


0 


Figure 3. | instruction format 


1.7.2 B-Form 


Figure 4. B instruction format 


1.7.3 SC-Form 


0 6 11 16 20 27 30 81 
OPCD Hl Il Il LEV | // | 1| / 


Figure 5. SC instruction format 


1.7.4 D-Form 


Figure 6. D instruction format 


1.7.5 DS-Form 


Figure 7. DS instruction format 
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1.7.6 X-Form 


11 16 21 31 
RA 


XO 
X 


Xx|x|x|x|x|x| x 
O 


x 


x< | x x| x] x] x 


El 


= 
= 
= 
x 


I 
= 
— 


ae 
al 
Es 

Bu 
EEE 


x 


x 
PS) Ed sss Ped cs Ps Ke Ge sd os (ls ls En 


Il 
I 
x 


paj 
x 


/ 


Figure 8. X instruction format 
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1.7.7 XL-Form 


Figure 9. XL instruction format 


1.7.8 XFX-Form 


7] 
A E 
oo [ar d» — 9 Y 


XO / 
Foo | ns p o» || e ^ 


Figure 10. XFX instruction format 


1.7.9 XFL-Form 


0 67 15 16 21 31 


Figure 11. XFL instruction format 


1.7.10 XS-Form 


0 6 11 16 21 30 31 
| OPCD | RS | RA | sh | xo fis 


Figure 12. XS instruction format 


1.7.11 XO-Form 


Figure 13. XO instruction format 
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1.7.12 A-Form 


Figure 14. A instruction format 


1.7.13 M-Form 


Figure 15. M instruction format 


1.7.14. MD-Form 


0 6 11 16 21 27 3081 
OPCD RS XOjshRa 


Fors se | w- | s ne rob 


Figure 16. MD instruction format 


1.7.15. MDS-Form 


27 
[oco | ss | RA] e | me rope 
Porco | ns | m [ as | me pope 


Figure 17. MDS instruction format 


1.7.16 Instruction Fields 


AA (30) 


Absolute Address bit. 


0 The immediate field represents an address 
relative to the current instruction address. 
For I-form branches the effective address of 
the branch target is the sum of the LI field 
sign-extended to 64 bits and the address of 
the branch instruction. For B-form branches 
the effective address of the branch target is 
the sum of the BD field sign-extended to 64 
bits and the address of the branch instruc- 
tion. 


1 The immediate field represents an absolute 
address. For l-form branches the effective 
address of the branch target is the LI field 
sign-extended to 64 bits. For B-form 
branches the effective address of the branch 
target is the BD field sign-extended to 64 
bits. 


BA (11:15) 
Field used to specify a bit in the CR to be used as 
a Source. 


BB (16:20) 
Field used to specify a bit in the CR to be used as 
a source. 


BD (16:29) 
Immediate field used to specify a 14-bit signed 
two's complement branch displacement which is 
concatenated on the right with 0b00 and sign- 
extended to 64 bits. 


BF (6:8) 
Field used to specify one of the CR fields or one 
of the FPSCR fields to be used as a target. 


BFA (11:13) 
Field used to specify one of the CR fields or one 
of the FPSCR fields to be used as a source. 


BH (19:20) 
Field used to specify a hint in the Branch Condi- 
tional to Link Register and Branch Conditional to 
Count Register instructions. The encoding is 
described in Section 2.4.1, “Branch Instructions” 
on page 20. 


BI (11:15) 
Field used to specify a bit in the CR to be tested 
by a Branch Conditional instruction. 


BO (6:10) 
Field used to specify options for the Branch Con- 
ditional instructions. The encoding is described in 
Section 2.4.1, “Branch Instructions” on page 20. 


BT (6:10) 
Field used to specify a bit in the CR or in the 
FPSCR to be used as a target. 
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D (16:31) 
Immediate field used to specify a 16-bit signed 
two's complement integer which is sign-extended 
to 64 bits. 


DS (16:29) 
Immediate field used to specify a 14-bit signed 
two's complement integer which is concatenated 
on the right with 0b00 and sign-extended to 64 
bits. 


FLM (7:14) 
Field mask used to identify the FPSCR fields that 
are to be updated by the mtfsf instruction. 


FRA (11:15) 
Field used to specify an FPR to be used as a 
source. 


FRB (16:20) 
Field used to specify an FPR to be used as a 
source. 


FRC (21:25) 
Field used to specify an FPR to be used as a 
source. 


FRS (6:10) 
Field used to specify an FPR to be used as a 
source. 


FRT (6:10) 
Field used to specify an FPR to be used as a 
target. 


FXM (12:19) 
Field mask used to identify the CR fields that are 
to be updated by the mterf instruction or moved 
by the optional version of the mfcr instruction. 


L (10) 
Field used to specify whether a fixed-point 
Compare instruction is to compare  64-bit 
numbers or 32-bit numbers. 


Field used by the Synchronize instruction (see 
Book Il, PowerPC AS Virtual Environment Archi- 
tecture). 


Field used by the TLB Invalidate Entry instruction 
(see Book Ill, PowerPC AS Operating Environ- 
ment Architecture). 


LEV (20:26) 
Field used by the System Call instruction. 


LI (6:29) 
Immediate field used to specify a 24-bit signed 
two's complement integer which is concatenated 
on the right with 0b00 and sign-extended to 64 
bits. 


LK (31) 
LINK bit. 


O Do not set the Link Register. 


1 Set the Link Register. The address of the 
instruction following the Branch instruction is 
placed into the Link Register. 


MB (21:25) and ME (26:30) 
Fields used in M-form instructions to specify a 
64-bit mask consisting of 1-bits from bit MB+32 
through bit ME+32 inclusive and O-bits else- 
where, as described in Section 3.3.12, “Fixed- 
Point Rotate and Shift Instructions” on page 68. 


MB (21:26) 
Field used in MD-form and MDS-form instructions 
to specify the first 1-bit of a 64-bit mask, as 
described in Section 3.3.12, “Fixed-Point Rotate 
and Shift Instructions” on page 68. 

ME (21:26) 
Field used in MD-form and MDS-form instructions 
to specify the last 1-bit of a 64-bit mask, as 
described in Section 3.3.12, “Fixed-Point Rotate 
and Shift Instructions” on page 68. 

NB (16:20) 
Field used to specify the number of bytes to 
move in an immediate Move Assist instruction. 

OPCD (0:5) 
Primary opcode field. 

OE (21) 
Field used by XO-form instructions to enable 
setting OV and SO in the XER. 

RA (11:15) 
Field used to specify a GPR to be used as a 
source or as a target. 

RB (16:20) 
Field used to specify a GPR to be used as a 
source. 

Re (31) 
RECORD bit. 


O Do not alter the Condition Register. 
1 Set Condition Register Field O or Field 1 as 


described in Section 2.3.1, “Condition 
Register” on page 18. 
RS (6:10) 
Field used to specify a GPR to be used as a 
source. 
RT (6:10) 


Field used to specify a GPR to be used as a 
target. 


SH (16:20, or 16:20 and 30) 
Field used to specify a shift amount. 


SI (16:31) 
Immediate field used to specify a 16-bit signed 
integer. 


SPR (11:20) 
Field used to specify a Special Purpose Register 
for the mtspr and mfspr instructions. 


SR (12:15) 
Field used by the Segment Register Manipulation 
instructions (see Book Ill, PowerPC AS Operating 
Environment Architecture). 


10 PowerPC AS User Instruction Set Architecture, Tags Inactive Subset 


IBM Confidential - Feb. 24, 1999 


TBR (11:20) 
Field used by the Move From Time Base instruc- 
tion (see Book Il, PowerPC AS Virtual Environ- 
ment Architecture). 


TH (9:10) 
Field used by the optional data stream variant of 
the dcbt instruction (see Book Il, PowerPC AS 
Virtual Environment Architecture). 


TO (6:10) 
Field used to specify the conditions on which to 
trap. The encoding is described in Section 3.3.10, 
“Fixed-Point Trap Instructions” on page 60. 


U (16:19) 
Immediate field used as the data to be placed 
into a field in the FPSCR. 


UI (16:31) 
Immediate field used to specify a 16-bit unsigned 
integer. 


XO (21:29, 21:30, 22:30, 26:30, 27:29, 27:30, or 30:31) 
Extended opcode field. 


1.8 Classes of Instructions 


An instruction falls into exactly one of the following 
three classes: 


Defined 
Illegal 
Reserved 


The class is determined by examining the opcode, and 
the extended opcode if any. If the opcode, or combi- 
nation of opcode and extended opcode, is not that of 
a defined instruction or of a reserved instruction, the 
instruction is illegal. 


A given instruction is in the same class for all imple- 
mentations of the PowerPC AS Architecture. In future 
versions of this architecture, instructions that are now 
illegal may become defined (by being added to the 
architecture) or reserved (by being assigned to one of 
the special purposes described in Appendix H, 
“Reserved Instructions” on page 177). Similarly, 
instructions that are now reserved may become 
defined. 


1.8.1 Defined Instruction Class 


This class of instructions contains all the instructions 
defined in the PowerPC AS User Instruction Set Archi- 
tecture, PowerPC AS Virtual Environment Architec- 
ture, and PowerPC AS Operating Environment 
Architecture. 


In general, defined instructions are guaranteed to be 


provided in all implementations. The only exceptions 
are instructions that are optional instructions. These 
exceptions are identified in the instruction 
descriptions. 


A defined instruction can have preferred and/or 
invalid forms, as described in Section 1.9.1, “Pre- 
ferred Instruction Forms” on page 12 and Section 
1.9.2, “Invalid Instruction Forms” on page 12. 


1.8.2 Illegal Instruction Class 


This class of instructions contains the set of 
instructions described in Appendix G, ‘Illegal 
Instructions” on page 175. Illegal instructions are 
available for future extensions of the PowerPC AS 
Architecture; that is, some future version of the 
PowerPC AS Architecture may define any of these 
instructions to perform new functions. 


Any attempt to execute an illegal instruction will 
cause the system illegal instruction error handler to 
be invoked and will have no other effect. 


An instruction consisting entirely of binary Os is guar- 
anteed always to be an illegal instruction. This 
increases the probability that an attempt to execute 
data or uninitialized storage will result in the invoca- 
tion of the system illegal instruction error handler. 


Architecture Note 


Instructions in this class were formerly called 
“invalid instructions”. The term was changed to 
“illegal instructions” to reduce confusion between 
these instructions and invalid forms of defined 
instructions. 


1.8.3 Reserved Instruction Class 


This class of instructions contains the set of 
instructions described in Appendix H, “Reserved 
Instructions” on page 177. 


Reserved instructions are allocated to specific pur- 
poses that are outside the scope of the PowerPC AS 
Architecture. 


Any attempt to execute a reserved instruction will: 


m perform the actions described in Book IV, 
PowerPC AS Implementation Features for the 
implementation if the instruction is implemented; 
or 

m cause the system illegal instruction error handler 
to be invoked if the instruction is not imple- 
mented. 
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1.9 Forms of Defined 
Instructions 


1.9.1 Preferred Instruction Forms 


Some of the defined instructions have preferred 
forms. For such an instruction, the preferred form will 
execute in an efficient manner, but any other form 
may take significantly longer to execute than the pre- 
ferred form. 


Instructions having preferred forms are: 


the Condition Register Logical instructions 

the Load/Store Multiple instructions 

the Load/Store String instructions 

the Or Immediate instruction (preferred form of 
no-op) 

= the Move To Condition Register Fields instruction 


1.9.2 Invalid Instruction Forms 


Some of the defined instructions have invalid forms. 
An instruction form is invalid if one or more fields of 
the instruction, excluding the opcode field(s), are 
coded incorrectly in a manner that can be deduced by 
examining only the instruction encoding. 


Any attempt to execute an invalid form of an instruc- 
tion will either cause the system illegal instruction 
error handler to be invoked or yield boundedly unde- 
fined results. Exceptions to this rule are stated in the 
instruction descriptions. 


Some kinds of invalid form can be deduced from the 
instruction layout. These are listed below. 


m Field shown as "/"(s) but coded as nonzero. 


m Field shown as containing a particular value but 
coded as some other value. 


These invalid forms are not discussed further. 


Instructions having invalid forms that cannot be so 
deduced are listed below. These kinds of invalid form 
are identified in the instruction descriptions. 


the Branch Conditional instructions 

the Load/Store with Update instructions 
the Load Multiple instruction 

the Load String instructions 

the Load/Store Floating-Point with 
instructions 


Update 


Assembler Note 


To the extent possible, the Assembler should 
report uses of invalid instruction forms as errors. 


Engineering Note 


Causing the system illegal instruction error 
handler to be invoked if attempt is made to 
execute an invalid form of an instruction facili- 
tates the debugging of software. 
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1.10 Optionality 


Some of the defined instructions are optional. The 
optional instructions are defined in Chapter 5, 
“Optional Facilities and Instructions” on page 119. 
Additional optional instructions may be defined in 
Books Il and Ill (e.g., see the section entitled “Look- 
aside Buffer Management” in Book Ill, and the chap- 
ters entitled “Optional Facilities and Instructions” in 
Book Il and Book III). 


I Architecture Note 


In general, optional facilities and instructions are 
described in chapters, appendices, and sections for 
which the title contains the word “Optional”. 


A facility or instruction is optional for any one of the 
following reasons. 


1. It is being phased into the architecture. At 
some future date it will be required and no 
longer optional. 


2. It is being phased out of the architecture. 
System developers should develop a migration 
plan to eliminate use of it in new systems. 


3. It is useful primarily for certain kinds of applica- 
tions and systems. It is likely to remain in the 
architecture, as optional. 


Categories 1 and 2 permit the architecture to evolve 
gradually, by providing an intermediate status for 
facilities and instructions that are being added to or 
removed from the architecture. Category 3 is 
intended for facilities and instructions that are typi- 
cally used primarily in library routines. 


The category that a given optional facility or instruc- 
tion is in can be identified as follows. The 
prototypical Notes and text shown below are altered 
as needed for each specific case. 


Category 1 


The description of each facility or instruction in this 
category contains an Engineering Note, the wording 
of which depends on how new the facility or instruc- 
tion is. When the facility or instruction is first added 
to the architecture, the wording is similar to the fol- 
lowing. 

Engineering Note: 

This instruction is being phased into the archi- 


tecture, and will become required in a future 
version of the architecture. 


Subsequently, when a version number “n.mm” of the 
architecture has been determined such that 
processors being designed to comply with other 
aspects of that version will implement the facility or 
instruction, the wording is changed to be similar to 
the following. 


Any attempt to execute an optional instruction that is 
not provided by the implementation will cause the 
system illegal instruction error handler to be invoked. 


In addition to instructions, other kinds of optional 
facilities, such as registers, may be defined in Books 
Il and Ill. The effects of attempting to use an optional 
facility that is not provided by the implementation are 
described in Books II and Ill as appropriate. 


Engineering Note: 

This instruction is being phased into the archi- 
tecture, and must be implemented in processors 
that comply with Version n.mm of the architec- 
ture specification or with any subsequent 
version. 


When the facility or instruction later becomes 
required, its description will be moved to the body of 
the Book if necessary, and the Engineering Note will 
be removed. 


Category 2 


The facilities and instructions in this category gener- 
ally appear in a separate chapter. A prominent 
warning such as the following appears in the chapter 
introduction. 


Warning: The facilities and instructions 
described in this chapter are being phased out 
of the architecture. 


Also, the description of each such facility or instruc- 
tion contains a Programming Note and an Engi- 
neering Note similar to the following. 


Programming Note: 

Warning: This instruction is being phased out of 
the architecture. It is likely to perform poorly on 
future implementations. New programs should 
not use it. 


Engineering Note: 
Decisions regarding whether to implement this 
instruction in a given implementation, and how 
well to make it perform there, must include con- 
sideration of migration plans for existing soft- 
ware that uses it. 


Category 3 


The facilities and instructions in this category are 
identified by the absence of the distinguishing marks 
of the other two categories. 


A | 
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1.11 Exceptions 


There are two kinds of exception, those caused 
directly by the execution of an instruction and those 
caused by an asynchronous event. In either case, the 
exception may cause one of several components of 
the system software to be invoked. 


The exceptions that can be caused directly by the 
execution of an instruction include the following: 


m an attempt to execute an illegal instruction, or an 
attempt by an application program to execute a 
“privileged” instruction (see Book III, PowerPC 
AS Operating Environment Architecture) (system 
illegal instruction error handler or system privi- 
leged instruction error handler) 


m the execution of a defined instruction using an 
invalid form (system illegal instruction error 
handler or system privileged instruction error 
handler) 


m the execution of an optional instruction that is not 
provided by the implementation (system illegal 
instruction error handler) 


m an attempt to access a storage location that is 
unavailable (system instruction storage error 
handler or system data storage error handler) 


m an attempt to access storage with an effective 
address alignment that is invalid for the instruc- 
tion (system alignment error handler) 


m the execution of a System Call instruction 
(system service program) 


m the execution of a Trap instruction that traps 
(system trap handler) 


m the execution of a floating-point instruction that 
causes a floating-point enabled exception to exist 
(system floating-point enabled exception error 
handler) 


The exceptions that can be caused by an asynchro- 
nous event are described in Book Ill, PowerPC AS 
Operating Environment Architecture. 


The invocation of the system error handler is precise, 
except that if one of the imprecise modes for invoking 
the system floating-point enabled exception error 
handler is in effect (see page 90) then the invocation 
of the system floating-point enabled exception error 
handler may be imprecise. When the system error 
handler is invoked imprecisely, the excepting instruc- 
tion does not appear to complete before the next 
instruction starts (because one of the effects of the 
excepting instruction, namely the invocation of the 
system error handler, has not yet occurred). 


Additional information about exception handling can 
be found in Book III, PowerPC AS Operating Environ- 
ment Architecture. 


1.12 Storage Addressing 


A program references storage using the effective 
address computed by the processor when it executes 
a Storage Access or Branch instruction (or certain 
other instructions described in Book Il, PowerPC AS 
Virtual Environment Architecture, and Book Ill, 
PowerPC AS Operating Environment Architecture), or 
when it fetches the next sequential instruction. 


1.12.1 Storage Operands 


Bytes in storage are numbered consecutively starting 
with 0. Each number is the address of the corre- 
sponding byte. 


Storage operands may be bytes, halfwords, words, or 
doublewords, or, for the Load/Store Multiple and 
Move Assist instructions, a sequence of bytes or 
words. The address of a storage operand is the 
address of its first byte (i.e., of its lowest-numbered 
byte). Byte ordering is Big-Endian. However, if the 
optional Little-Endian facility is implemented the 
system can be operated in a mode in which byte 
ordering is Little-Endian; see Section 5.3. 


Operand length is implicit for each instruction. 


The operand of a single-register Storage Access 
instruction has a "natural" alignment boundary equal 
to the operand length. In other words, the "natural" 
address of an operand is an integral multiple of the 
operand length. A storage operand is said to be 
aligned if it is aligned at its natural boundary; other- 
wise it is said to be unaligned. 


Storage operands for single-register Storage Access 
instructions have the following characteristics. 
(Although not permitted as storage operands, 
quadwords are shown because quadword alignment is 
desirable for certain storage operands.) 


Operand Length Addrso:s3 if aligned 
Byte 8 bits XXXX 

Halfword 2 bytes xxx0 

Word 4 bytes xx00 

Doubleword 8 bytes x000 

Quadword 16 bytes 0000 

Note: An “x” in an address bit position indicates 
that the bit can be 0 or 1 independent of the state of 
other bits in the address. 


The concept of alignment is also applied more gener- 
ally, to any datum in storage. For example, a 12-byte 
datum in storage is said to be word-aligned if its 
address is an integral multiple of 4. 
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Some instructions require their storage operands to 
have certain alignments. In addition, alignment may 
affect performance. For single-register Storage 
Access instructions the best performance is obtained 
when storage operands are aligned. Additional 
effects of data placement on performance are 
described in Book Il, PowerPC AS Virtual Environment 
Architecture. 


Instructions are always four bytes long and word- 
aligned. 


1.12.2 Effective Address Calculation 


The address computed by the processor when exe- 
cuting a Storage Access or Branch instruction (or 
certain other instructions described in Book Il, 
PowerPC AS Virtual Environment Architecture, and 
Book Ill, PowerPC AS Operating Environment Archi- 
tecture), or when fetching the next sequential instruc- 
tion, is called the effective address and specifies a 
byte in storage. 


In general, effective address computations, for both 
data and instruction accesses, use 64-bit effective 
address addition. Thus all 64 bits participate, regard- 
less of mode (32-bit or 64-bit). The 64-bit current 
instruction address and next instruction address are 
not affected by a change from 32-bit mode to 64-bit 
mode, but they are affected by a change from 64-bit 
mode to 32-bit mode (the high-order 32 bits are set to 
0). 


In 64-bit mode, the entire 64-bit result comprises the 
64-bit effective address. The effective address arith- 
metic wraps around from the maximum address, 
284 1, to address 0. 


In 32-bit mode, the low-order 32 bits of the 64-bit 
result comprise the effective address for the purpose 
of addressing storage. The high-order 32 bits of the 
64-bit effective address are ignored for the purpose of 
accessing data, but are included whenever a 64-bit 
effective address is placed into a GPR by Load with 
Update and Store with Update instructions. The high- 
order 32 bits of the 64-bit effective address are set to 
O for the purpose of fetching instructions, and when- 
ever a 64-bit effective address is placed into the Link 
Register by Branch instructions having LK=1. The 
high-order 32 bits of the 64-bit effective address are 
set to O in Special Purpose Registers when the 
system error handler is invoked. As used to address 
storage, the effective address arithmetic appears to 
wrap around from the maximum address, 282-1, to 
address 0. 


A zero in the RA field indicates the absence of the 
corresponding address component. For the absent 
component, a value of zero is used for the address. 
This is shown in the instruction descriptions as (RAJO). 


Effective addresses are computed as follows. In the 
descriptions below, it should be understood that "the 
contents of a GPR” refers to the entire 64-bit con- 
tents, independent of mode, but that in 32-bit mode 
only bits 32:63 of the 64-bit result of the computation 
are used to address storage. 


m With X-form instructions, in computing the effec- 
tive address of a data element, the contents of 
the GPR designated by RB (or the value zero for 
Iswi and stswi) are added to the contents of the 
GPR designated by RA or to zero if RA=0. 


m With D-form instructions, the 16-bit D field is sign- 
extended to form a 64-bit address component. In 
computing the effective address of a data 
element, this address component is added to the 
contents of the GPR designated by RA or to zero 
if RA=0. 


m With DS-form instructions, the 14-bit DS field is 
concatenated on the right with 0b00 and sign- 
extended to form a 64-bit address component. In 
computing the effective address of a data 
element, this address component is added to the 
contents of the GPR designated by RA or to zero 
if RA=0. 


m With I-form Branch instructions, the 24-bit LI field 
is concatenated on the right with 0b00 and sign- 
extended to form a 64-bit address component. If 
AA=0, this address component is added to the 
address of the Branch instruction to form the 
effective address of the next instruction. If 
AA=1, this address component is the effective 
address of the next instruction. 


m With B-form Branch instructions, the 14-bit BD 
field is concatenated on the right with 0b00 and 
sign-extended to form a 64-bit address compo- 
nent. If AA=0, this address component is added 
to the address of the Branch instruction to form 
the effective address of the next instruction. If 
AA=1, this address component is the effective 
address of the next instruction. 


m With XL-form Branch instructions, bits 0:61 of the 
Link Register or the Count Register are concat- 
enated on the right with 0b00 to form the effec- 
tive address of the next instruction. 


m With sequential instruction fetching, the value 4 is 
added to the address of the current instruction to 
form the effective address of the next instruction. 
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Chapter 2. Branch Processor 


2.1 Branch Processor Overview .... 17 
2.2 Instruction Fetching ......... 17 
2.3 Branch Processor Registers .... 18 
2.3.1 Condition Register ......... 18 
2.3.2 Link Register ............ 19 
2.3.3 Count Register ........... 19 
2.4 Branch Processor Instructions ... 20 


2.1 Branch Processor Overview 


This chapter describes the registers and instructions 
that make up the Branch Processor facility. Section 
2.3, “Branch Processor Registers” on page 18 
describes the registers associated with the Branch 
Processor. Section 2.4, “Branch Processor 
Instructions” on page 20 describes the instructions 
associated with the Branch Processor. 


2.2 Instruction Fetching 


In general, instructions appear to execute sequen- 
tially, in the order in which they appear in storage. 
The exceptions to this rule are listed below. 


m Branch instructions for which the branch is taken 
cause execution to continue at the target address 
specified by the Branch instruction. 


m Trap instructions for which the trap conditions are 
satisfied, and System Call instructions, cause the 
appropriate system handler to be invoked. 


m Exceptions can cause the system error handler to 
be invoked, as described in Section 1.11, 
"Exceptions" on page 14. 


m Returning from a system service program, 
system trap handler, or system error handler 
causes execution to continue at a specified 
address. 


2.4.1 Branch Instructions ........ 20 
2.4.2 System Call Instruction ...... 25 
2.4.3 Condition Register Logical 
Instructions ................ 26 
2.4.4 Condition Register Field 
Instruction ................. 28 


The model of program execution in which each 
instruction appears to complete before the next 
instruction starts is called the “sequential execution 
model”. In general, from the view of the processor 
executing the instructions, the sequential execution 
model is obeyed. For the instructions and facilities 
defined in this Book, the only exceptions to this rule 
are the following. 


m A floating-point exception occurs when the 
processor is running in one of the Imprecise float- 
ing-point exception modes (see Section 4.4, 
“Floating-Point Exceptions” on page 89). The 
instruction that causes the exception does not 
complete before the next instruction starts, with 
respect to setting exception bits and (if the excep- 
tion is enabled) invoking the system error 
handler. 


m A Store instruction modifies a storage location 
that contains an instruction. Software synchroni- 
zation is required to ensure that subsequent 
instruction fetches from that location obtain the 
modified version of the instruction; see Book Il, 
PowerPC AS Virtual Environment Architecture. 


Programming Note 


If a program modifies the instructions it 
intends to execute, it should call the appro- 
priate system library program before 
attempting to execute the modified 


instructions, to ensure that the modifications 
have taken effect with respect to instruction 
fetching. 
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2.3 Branch Processor Registers 


2.3.1 Condition Register 


The Condition Register (CR) is a 32-bit register which 
reflects the result of certain operations, and provides 
a mechanism for testing (and branching). 


0 31 


Figure 18. Condition Register 


The bits in the Condition Register are grouped into 
eight 4-bit fields, named CR Field 0 (CRO), ..., CR Field 
7 (CR7), which are set in one of the following ways. 


m Specified fields of the CR can be set by a move 
to the CR from a GPR (mterf). 

m A specified field of the CR can be set by a move 
to the CR from another CR field (merf), from 
XERa2.35 (merxr), or from the FPSCR (mcrfs). 

m CR Field O can be set as the implicit result of a 
fixed-point instruction. 

m CR Field 1 can be set as the implicit result of a 
floating-point instruction. 

m A specified CR field can be set as the result of 
either a fixed-point or a floating-point Compare 
instruction. 


Instructions are provided to perform logical oper- 
ations on individual CR bits and to test individual CR 
bits. 


For all fixed-point instructions in which Rc=1, and for 
addic., andi., and andis., the first three bits of CR 
Field O (bits 0:2 of the Condition Register) are set by 
signed comparison of the result to zero, and the 
fourth bit of CR Field O (bit 3 of the Condition Reg- 
ister) is copied from the SO field of the XER. "Result" 
here refers to the entire 64-bit value placed into the 
target register in 64-bit mode, and to bits 32:63 of the 
64-bit value placed into the target register in 32-bit 
mode. 


if (64-bit mode) 

then M € 0 

else M € 32 
if (target register)ygg < 0 then c € 0b100 
else if (target register)y.gg > 0 then c € 0b010 
else c « 0b001 
CRO € c | XERso 


If any portion of the result is undefined, then the 
value placed into the first three bits of CR Field O is 
undefined. 

The bits of CR Field 0 are interpreted as follows. 

Bit Description 


O Negative (LT) 
The result is negative. 


1 Positive (GT) 
The result is positive. 


2 Zero (EQ) 
The result is zero. 


3 Summary Overflow (SO) 
This is a copy of the final state of XERgo at the 
completion of the instruction. 


Programming Note 


CR Field 0 may not reflect the “true” (infinitely 
precise) result if overflow occurs; see Section 
3.3.8, “Fixed-Point Arithmetic Instructions” on 
page 49. 


The stwcx. and stdcx. instructions (see Book Il, 
PowerPC AS Virtual Environment Architecture) also 
set CR Field 0. 


For all floating-point instructions in which Rc=1, CR 
Field 1 (bits 4:7 of the Condition Register) is set to the 
Floating-Point exception status, copied from bits 0:3 of 
the Floating-Point Status and Control Register. These 
bits are interpreted as follows. 


Bit Description 


4 Floating-Point Exception Summary (FX) 
This is a copy of the final state of FPSCRry at the 
completion of the instruction. 


5 Floating-Point Enabled Exception Summary (FEX) 
This is a copy of the final state of FPSCRrry at 
the completion of the instruction. 


6 Floating-Point 
Summary (VX) 
This is a copy of the final state of FPSCRyx at the 
completion of the instruction. 


Invalid Operation Exception 


7 Floating-Point Overflow Exception (OX) 
This is a copy of the final state of FPSCRox at 
the completion of the instruction. 


For Compare instructions, a specified CR field is set 
to reflect the result of the comparison. The bits of the 
specified CR field are interpreted as follows. A com- 
plete description of how the bits are set is given in 
the instruction descriptions in Section 3.3.9, “Fixed- 
Point Compare Instructions” on page 58 and Section 
4.6.7, “Floating-Point Compare Instructions” on 
page 114. 


Bit Description 


O Less Than, Floating-Point Less Than (LT, FL) 
For fixed-point Compare instructions, (RA) < Sl 
or (RB) (signed comparison) or (RA) < UI or (RB) 
(unsigned comparison). For floating-point 
Compare instructions, (FRA) < (FRB). 
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1 Greater Than, Floating-Point Greater Than (GT, 
FG) 
For fixed-point Compare instructions, (RA) > SI 
or (RB) (signed comparison) or (RA) * UI or (RB) 
(unsigned comparison). For floating-point 
Compare instructions, (FRA) > (FRB). 


2 Equal, Floating-Point Equal (EQ, FE) 
For fixed-point Compare instructions, (RA) = SI, 
Ul, or (RB). For floating-point Compare 
instructions, (FRA) = (FRB). 


3 Summary Overflow, Floating-Point Unordered 
(SO, FU) 
For fixed-point Compare instructions, this is a 
copy of the final state of XERso at the completion 
of the instruction. For floating-point Compare 
instructions, one or both of (FRA) and (FRB) is a 
NaN. 


2.3.2 Link Register 


The Link Register (LR) is a 64-bit register. It can be 
used to provide the branch target address for the 
Branch Conditional to Link Register instruction, and it 
holds the return address after Branch instructions for 
which LK=1. 


| LR 
0 63 


Figure 19. Link Register 


2.3.3 Count Register 


The Count Register (CTR) is a 64-bit register. It can 
be used to hold a loop count that can be decremented 
during execution of Branch instructions that contain 
an appropriately coded BO field. If the value in the 
Count Register is O before being decremented, it is 
— 1 afterward. The Count Register can also be used 
to provide the branch target address for the Branch 
Conditional to Count Register instruction. 


CTR 
0 63 


Figure 20. Count Register 
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2.4 Branch Processor Instructions 


2.4.1 Branch Instructions 


The sequence of instruction execution can be changed 
by the Branch instructions. Because all instructions 
are on word boundaries, bits 62 and 63 of the gener- 
ated branch target address are ignored by the 
processor in performing the branch. 


The Branch instructions compute the effective 
address (EA) of the target in one of the following four 
ways, as described in Section 1.12.2, “Effective 
Address Calculation” on page 15. 


1. Adding a displacement to the address of the 
Branch instruction (Branch or Branch Conditional 
with AA=0). 


2. Specifying an absolute address (Branch or 
Branch Conditional with AA=1). 


3. Using the address contained in the Link Register 
(Branch Conditional to Link Register). 


4. Using the address contained in the Count Reg- 
ister (Branch Conditional to Count Register). 


In all four cases, in 32-bit mode the final step in the 
address computation is setting the high-order 32 bits 
of the target address to 0. 


For the first two methods, the target addresses can 
be computed sufficiently ahead of the Branch instruc- 
tion that instructions can be prefetched along the 
target path. For the third and fourth methods, pre- 
fetching instructions along the target path is also pos- 
sible provided the Link Register or the Count Register 
is loaded sufficiently ahead of the Branch instruction. 


Branching can be conditional or unconditional, and 
the return address can optionally be provided. If the 
return address is to be provided (LK=1), the effective 
address of the instruction following the Branch 
instruction is placed into the Link Register after the 
branch target address has been computed; this is 
done regardless of whether the branch is taken. 


For Branch Conditional instructions, the BO field 
specifies the conditions under which the branch is 
taken, as shown in Figure 21. In the figure, M=0 in 
64-bit mode and M=32 in 32-bit mode. If the BO field 
specifies that the CTR is to be decremented, the 
entire 64-bit CTR is decremented regardless of the 
mode. 


Decrement the CTR, then branch if the 
decremented CTRiy.g3# 0 and CRg¡=0 
Decrement the CTR, then branch if the 


decremented CTRy.g3= 0 and CRg¡= 0 
Branch if CRg¡= 0 


Decrement the CTR, then branch if the 
decremented CTRy.g3= 0 and CRg;= 1 


Branch if CRg;= 1 


Branch always 


ZA > denotes a bit that is ignored. 
5 The “a” and “t” bits are used as described 
below. 


Figure 21. BO field encodings 


The “a” and “t” bits of the BO field can be used by 
software to ee a hint about whether the branch 
is likely to be taken or is likely not to be taken, as 
shown in Figure 22. 


at | Hint 


Few — — — — — — 


Figure 22. 


"at" bit encodings 


Programming Note 


Many implementations have dynamic mechanisms 
for predicting whether a branch will be taken. 
Because the dynamic prediction is likely to be 
very accurate, and is likely to be overridden by 
any hint provided by the "at" bits, the “at” bits 
should be set to 0b00 unless the static prediction 
implied by at=0b10 or at=0b11 is highly likely to 
be correct. 
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For Branch Conditional to Link Register and Branch 
Conditional to Count Register instructions, the BH 
field provides a hint about the use of the instruction, 
as shown in Figure 28. 


The instruction is a subroutine return 

becir{ I]: The instruction is not a subroutine 
return; the target address is likely to 
be the same as the target address 
used the preceding time the branch 
was taken 


The instruction is not a subroutine 


return; the target address is likely to 
be the same as the target address 
used the preceding time the branch 
was taken 

bcctr[ I]: Reserved 


11 | belr[ I] and bectr[ I]: The target address is not 
predictable 


Figure 23. BH field encodings 


Programming Note 


The hint provided by the BH field is independent 
of the hint provided by the “at” bits (e.g., the BH 
field provides no indication of whether the branch 
is likely to be taken). 


Extended mnemonics for branches 


Many extended mnemonics are provided so that 
Branch Conditional instructions can be coded with 
portions of the BO and Bl fields as part of the mne- 
monic rather than as part of a numeric operand. 
Some of these are shown as examples with the 
Branch instructions. See Appendix B, “Assembler 
Extended Mnemonics” on page 143 for additional 
extended mnemonics. 


Programming Note 


The hints provided by the “at” bits and by the BH 
field do not affect the results of executing the 
instruction. 


The “z” bits should be set to 0, as they may be 
assigned a meaning in some future version of the 
architecture. 


Programming Note 


Many implementations have dynamic mechanisms 
for predicting the target addresses of bclr[I] and 
bcctr[I] instructions. These mechanisms may 
cache return addresses (i.e., Link Register values 
set by Branch instructions for which LK=1 and for 
which the branch was taken) and recently used 
branch target addresses. To obtain the best per- 
formance across the widest range of implementa- 
tions, the programmer should obey the following 
rules. 


m Use Branch instructions for which LK=1 only 
as subroutine calls (including function calls, 
etc.). 

m Pair each subroutine call (i.e., each Branch 
instruction for which LK=1 and the branch is 
taken) with a belr instruction that returns from 
the subroutine and has BH=0b00. 

m Do not use belrl as a subroutine call. (Some 
implementations access the return address 
cache at most once per instruction; such 
implementations are likely to treat bclrl as a 
subroutine return, and not as a subroutine 
call.) 

m For bclr(l] and bcctr[I], use the appropriate 
value in the BH field. 


The following are examples of programming con- 
ventions that obey these rules. In the examples, 
BH is assumed to contain 0b00 unless otherwise 
stated. In addition, the “at” bits are assumed to 
be coded appropriately. 


Let A, B, and Glue be specific programs. 


m Loop counts: 
Keep them in the Count Register, and use a 
be instruction (LK=0) to decrement the count 
and to branch back to the beginning of the 
loop if the decremented count is nonzero. 


= Computed goto's, case statements, etc.: 
Use the Count Register to hold the address to 
branch to, and use a bectr instruction (LK=0, 
and BH=0b11 if appropriate) to branch to the 
selected address. 


m Direct subroutine linkage: 


Here A calls B and B returns to A. The two 
branches should be as follows. 


— A calls B: use a bl or bel instruction 
(LK=1). 

— B returns to A: use a bclr instruction 
(LK=0) (the return address is in, or can 
be restored to, the Link Register). 


m Indirect subroutine linkage: 

Here A calls Glue, Glue calls B, and B returns 
to A rather than to Glue. (Such a calling 
sequence is common in linkage code used 
when the subroutine that the programmer 
wants to call, here B, is in a different module 
from the caller; the Binder inserts “glue” 
code to mediate the branch.) The three 
branches should be as follows. 


— A calls Glue: use a bl or bel instruction 
(LK=1). 


(Programming Note continues in next column....) 
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Programming Note (continued) 


— Glue calls B: place the address of B into 
the Count Register, and use a bcctr 
instruction (LK=0). 

— B returns to A: use a belr instruction 
(LK=0) (the return address is in, or can 
be restored to, the Link Register). 


m Function call: 

Here A calls a function, the identity of which 
may vary from one instance of the call to 
another, instead of calling a specific program 
B. This case should be handled using the 
conventions of the preceding two bullets, 
depending on whether the call is direct or 
indirect, with the following differences. 


— If the call is direct, place the address of 
the function into the Count Register, and 
use a bectrl instruction (LK=1) instead of 
a bl or bel instruction. 

— For the bcctr[I] instruction that branches 
to the function, use BH=0b11 if appro- 
priate. 


Compatibility Note 
The bits corresponding to the current “a” and “t” 


bits, and to the current “z” bits except in the 
“branch always" BO encoding, had different 
meanings in versions of the architecture that 


precede Version 2.00. 


m The bit corresponding to the "t" bit was called 
the “y” bit. The “y” bit indicated whether to 
use the architected default prediction (y=0) 
or to use the complement of the default pre- 
diction (y=1). The default prediction was 
defined as follows. 


— |f the instruction is bc[ I][a] with a nega- 
tive value in the displacement field, the 
branch is taken. (This is the only case in 
which the prediction corresponding to the 
“y” bit differs from the prediction corre- 
sponding to the “t” bit.) 


— In all other cases (be[/|[a] with a non- 
negative value in the displacement field, 
bclr[I], or bectr[ fl), the branch is not 
taken. 


m The BO encodings that test both the Count 
Register and the Condition Register had a “y” 
bit in place of the current “z” bit. The 
meaning of the “y” bit was as described in 
the preceding item. 


m The “a” bit was a “z” bit. 


Because these bits have always been defined 
either to be ignored or to be treated as hints, a 
given program will produce the same result on 
any implementation regardless of the values of 
the bits. Also, because even the “y” bit is 
ignored, in practice, by most processors that 
implement versions of the architecture that 
precede Version 2.00, the performance of a given 
program on those processors will not be affected 
by the values of the bits. 


Architecture Note 


In some future version of the architecture, the 
value at=0b01 may be used to indicate that the 
branch path (taken or not taken) is unpredictable 
(i.e., that neither static nor dynamic prediction is 
likely to predict the path accurately). It is 
expected that any new meaning will be such that 
future Branch Conditional instructions that use 
at=0b01 would use at=0b00 in the current archi- 
tecture. 


Decisions regarding assignment of a meaning for 
at=0b01 must include consideration of the extent 
to which software still uses the earlier meaning 
(see the preceding Compatibility Note), and of the 
effect that the new meaning would have on the 
performance of such software. 


Decisions regarding assignment of a meaning for 
bit 16 of bclr[I] and bcctr[ I] instructions in some 
future version of the architecture (e.g., to extend 
the BH field) must include consideration of the 
fact that processors that implement versions of 
the architecture that precede Version 2.00 may 
use the bit in computing the prediction associated 
with the “y” bit. Specifically, for all three Branch 
Conditional instructions, such processors may 
predict that the branch will be taken if the value 
of the following expression is 1, and will not be 
taken if the value is 0. *s" represents bit 16 of the 
instruction. 


(BO, & BO») | (s € BO,) 


The expression assumes that instruction bit 16, 
which is the sign bit of the displacement field for 
bce[I][ a], contains 0 for bclr[1] and bcctr[ I]. 
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Branch l-form 


Branch Conditional B-form 


b target addr (AA=0 LK=0) 
ba target addr (AA=1 LK=0) 
bl target_addr (AA=0 LK=1) 
bla target_addr (AA=1 LK=1) 

18 LI AA|LK 
0 6 30 | 31 


if AA then NIA €&,, EXTS(LI | 0b00) 
else NIA €., CIA + EXTS(LI | 0b00) 
if LK then LR €,, CIA + 4 


target addr specifies the branch target address. 


If AA=0 then the branch target address is the sum of 
LI | 0b00 sign-extended and the address of this 
instruction, with the high-order 32 bits of the branch 
target address set to 0 in 32-bit mode. 


If AA=1 then the branch target address is the value 
LI | 0b00 sign-extended, with the high-order 32 bits of 
the branch target address set to 0 in 32-bit mode. 


If LK=1 then the effective address of the instruction 
following the Branch instruction is placed into the Link 
Register. 


Special Registers Altered: 
LR (if LK=1) 


bc BO,Bl,target_addr (AA=0 LK=0) 
bca BO,Bl,target_addr (AA=1 LK=0) 
bel BO,Bl,target_addr (AA=0 LK=1) 
bcla BO,Bl,target_addr (AA=1 LK=1) 


16 BO Bl BD AA|LK 
0 6 11 16 30 | 31 


if (64-bit mode) 


then M € 0 
else M € 32 
if BO, then CTR € CTR — 1 
ctr ok € BO, | ((CTRy63 # 0) O BOs) 


cond ok € BOg | (CRg, = B0) 

if ctr ok & cond ok then 

if AA then NIA €., EXTS(BD | 0500) 

else NIA €.. CIA + EXTS(BD | 0b00) 
if LK then LR €, CIA + 4 


The BI field specifies the Condition Register bit to be 
tested. The BO field is used to resolve the branch as 
described in Figure 21. target addr specifies the 
branch target address. 


If AA=0 then the branch target address is the sum of 
BD | 0b00 sign-extended and the address of this 
instruction, with the high-order 32 bits of the branch 
target address set to 0 in 32-bit mode. 


If AA=1 then the branch target address is the value 
BD | 0b00 sign-extended, with the high-order 32 bits 
of the branch target address set to 0 in 32-bit mode. 


If LK=1 then the effective address of the instruction 
following the Branch instruction is placed into the Link 
Register. 


Special Registers Altered: 
CTR (if BO,=0) 
LR (if LK=1) 


Extended Mnemonics: 


Examples of extended mnemonics for Branch Condi- 
tional: 


Extended: Equivalent to: 

bit target bc 12,0,target 
bne cr2,target bc 4,10,target 
bdnz target bc 16,0,target 
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Branch Conditional to Link Register 
XL-form 


belr BO,BI,BH (LK=0) 
belrl BO,BI,BH (LK=1) 


[POWER mnemonics: bcr, bcrl] 


Bl 11/ BH 16 LK 


if (64-bit mode) 

then M ¢ 0 

else M € 32 

if BO, then CTR € CTR - 1 

ctr ok € BOs | ((CTRm:63 + 0) O BOs) 

cond ok € BOg | (CRg, = B04) 

if ctr ok & cond ok then NIA €.. LRoey | 0b00 
if LK then LR €., CIA + 4 


The BI field specifies the Condition Register bit to be 
tested. The BO field is used to resolve the branch as 
described in Figure 21. The BH field is used as 
described in Figure 23. The branch target address is 
LRo:s1 | 0b00, with the high-order 32 bits of the branch 
target address set to 0 in 32-bit mode. 


If LK=1 then the effective address of the instruction 
following the Branch instruction is placed into the Link 
Register. 


Special Registers Altered: 
CTR (if BO22 0) 
LR (if LK=1) 


Extended Mnemonics: 


Examples of extended mnemonics for Branch Condi- 
tional to Link Register: 


Extended: Equivalent to: 
belr 4,6 bclr 4,6,0 

bltlr belr 12,0,0 
bnelr  cr2 belr 4,10,0 
bdnzlr belr 16,0,0 


Programming Note 


belr, bclrl, bcctr, and bcctrl each serve as both a 
basic and an extended mnemonic. The Assembler 
will recognize a bclr, bclrl, bectr, or bectr! mne- 
monic with three operands as the basic form, and 
a belr, belrl, bectr, or bcctrl mnemonic with two 
operands as the extended form. In the extended 
form the BH operand is omitted and assumed to 
be 0b00. 


Branch Conditional to Count Register 
XL-form 


bectr BO,BI,BH (LK=0) 
bcctrl BO,BI,BH (LK=1) 


[POWER mnemonics: bcc, becl] 


cond ok € BOg | (CRg, = B04) 
if cond ok then NIA 4., CTRog, | 0b00 
if LK then LR €, CIA + 4 


The BI field specifies the Condition Register bit to be 
tested. The BO field is used to resolve the branch as 
described in Figure 21. The BH field is used as 
described in Figure 23. The branch target address is 
CTRog, | 0b00, with the high-order 32 bits of the 
branch target address set to 0 in 32-bit mode. 


If LK21 then the effective address of the instruction 
following the Branch instruction is placed into the Link 
Register. 


If the “decrement and test CTR" option is specified 
(BO»=0), the instruction form is invalid. 


Special Registers Altered: 
LR (if LK=1) 
Extended Mnemonics: 


Examples of extended mnemonics for Branch Condi- 
tional to Count Register: 


Extended: Equivalent to: 
bectr 4,6 bcctr 4,6,0 

bltctr bcctr 12,0,0 
bnectr cr2 bcctr | 4,10,0 
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2.4.2 System Call Instruction 


This instruction provides the means by which a 
program can call upon the system to perform a 
service. 


System Call SC-form 


sc LEV 
[POWER mnemonic: svca] 


This instruction calls the system to perform a service. 
A complete description of this instruction can be 
found in Book Ill, PowerPC AS Operating Environment 
Architecture. 


The use of the LEV field is described in Book Ill. The 
contents of the LEV field must be 0 or 1; otherwise the 
results are boundedly undefined. 


When control is returned to the program that exe- 
cuted the System Call instruction, the contents of the 
registers will depend on the register conventions used 
by the program providing the system service. 


This instruction is context synchronizing (see Book III, 
PowerPC AS Operating Environment Architecture). 


Special Registers Altered: 
Dependent on the system service 


Programming Note 


sc serves as both a basic and an extended mne- 
monic. The Assembler will recognize an sc mne- 
monic with one operand as the basic form, and an 
sc mnemonic with no operand as the extended 
form. In the extended form the LEV operand is 
omitted and assumed to be 0. 


In application programs the value of the LEV 
operand for se should be 0. 


Compatibility Note 


For a discussion of POWER compatibility with 
respect to instruction bits 16:19 and 27:29, see 
Appendix E, “Incompatibilities with the POWER 
Architecture” on page 163. For compatibility with 
future versions of the PowerPC AS Architecture, 
these bits should be coded as zeros. 
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2.4.3 Condition Register Logical Instructions 


The Condition Register Logical instructions have pre- 
ferred forms; see Section 1.9.1, “Preferred Instruction 
Forms” on page 12. In the preferred forms, the BT 
and BB fields satisfy the following rule. 


m The bit specified by BT is in the same Condition 
Register field as the bit specified by BB. 


Condition Register AND XL-form 


crand BT,BA,BB 


CRgr € CRga € CRgg 


The bit in the Condition Register specified by BA is 
ANDed with the bit in the Condition Register specified 
by BB, and the result is placed into the bit in the Con- 
dition Register specified by BT. 


Special Registers Altered: 
R gr 


Condition Register XOR XL-form 


crxor BT,BA,BB 


The bit in the Condition Register specified by BA is 
XORed with the bit in the Condition Register specified 
by BB, and the result is placed into the bit in the Con- 
dition Register specified by BT. 


Special Registers Altered: 
R gT 


Extended Mnemonics: 


Example of extended mnemonics for Condition Reg- 


ister XOR: 
Extended: Equivalent to: 
crclr Bx crxor Bx,Bx,Bx 


Extended mnemonics for Condition 
Register logical operations 


A set of extended mnemonics is provided that allow 
additional Condition Register logical operations, 
beyond those provided by the basic Condition Reg- 
ister Logical instructions, to be coded easily. Some of 
these are shown as examples with the Condition Reg- 
ister Logical instructions. See Appendix B, “Assem- 
bler Extended Mnemonics” on page 143 for additional 
extended mnemonics. 


Condition Register OR XL-form 


cror BT,BA,BB 


CRgr € CRga | CRgg 


The bit in the Condition Register specified by BA is 
ORed with the bit in the Condition Register specified 
by BB, and the result is placed into the bit in the Con- 
dition Register specified by BT. 


Special Registers Altered: 
R gT 


Extended Mnemonics: 


Example of extended mnemonics for Condition Reg- 
ister OR: 


Extended: 
crmove Bx,By cror 


Condition Register NAND XL-form 


Equivalent to: 
Bx,By,By 


crnand BT,BA,BB 


CRgT + ^ (CRBA & CRep) 


The bit in the Condition Register specified by BA is 
ANDed with the bit in the Condition Register specified 
by BB, and the complemented result is placed into the 
bit in the Condition Register specified by BT. 


Special Registers Altered: 
R gr 
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Condition Register NOR XL-form 


crnor BT,BA,BB 


19 BT BA BB 33 / 
0 6 11 16 21 31 


CRgr €(CRga | CRgg) 


The bit in the Condition Register specified by BA is 
ORed with the bit in the Condition Register specified 
by BB, and the complemented result is placed into the 
bit in the Condition Register specified by BT. 


Special Registers Altered: 
CR pr 


Extended Mnemonics: 


Example of extended mnemonics for Condition Reg- 


ister NOR: 
Extended: Equivalent to: 
crnot Bx,By crnor Bx,By,By 


Condition Register AND with 
Complement XL-form 


crandc BT,BA,BB 


19 BT BA BB 129 / 
0 6 11 16 21 31 


CRBT ¢ CRBA & 7CRep 


The bit in the Condition Register specified by BA is 
ANDed with the complement of the bit in the Condi- 
tion Register specified by BB, and the result is placed 
into the bit in the Condition Register specified by BT. 


Special Registers Altered: 
CR gr 


Condition Register Equivalent XL-form 


creqv BT,BA,BB 


19 BT BA BB 289 / 
0 6 11 16 21 31 


CRBT € CRea = CRpp 


The bit in the Condition Register specified by BA is 
XORed with the bit in the Condition Register specified 
by BB, and the complemented result is placed into the 
bit in the Condition Register specified by BT. 


Special Registers Altered: 
CR gr 
Extended Mnemonics: 
Example of extended mnemonics for Condition Reg- 
ister Equivalent: 


Extended: 
crset Bx 


Equivalent to: 
cregv Bx,Bx,Bx 


Condition Register OR with Complement 
XL-form 


crorc BT,BA,BB 


19 BT BA BB 417 / 
0 6 11 16 21 31 


CRBT € CRea | 7CRep 


The bit in the Condition Register specified by BA is 
ORed with the complement of the bit in the Condition 
Register specified by BB, and the result is placed into 
the bit in the Condition Register specified by BT. 


Special Registers Altered: 
CR gr 
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2.4.4 Condition Register Field 
Instruction 


Move Condition Register Field XL-form 


merf BF,BFA 


CR4xBF:4xBF+3 € CRaxBra:4xBFA+3 


The contents of Condition Register field BFA are 
copied to Condition Register field BF. 


Special Registers Altered: 
CR field BF 
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Chapter 3. Fixed-Point Processor 


3.1 Fixed-Point Processor Overview .. 29 
3.2 Fixed-Point Processor Registers . . 29 
3.2.1 General Purpose Registers .... 29 


3.2.2 Fixed-Point Exception Register . 30 
3.3 Fixed-Point Processor Instructions 31 
3.3.1 Fixed-Point Storage Access 


Instructions ................ 31 
3.3.1.1 Storage Access Exceptions ... 31 
3.3.2 Fixed-Point Load Instructions .. 31 
3.3.3 Fixed-Point Store Instructions .. 38 
3.3.4 Fixed-Point Load and Store with 

Byte Reversal Instructions ....... 42 
3.3.5 Fixed-Point Load and Store 

Multiple Instructions ........... 44 


3.1 Fixed-Point Processor Overview 


This chapter describes the registers and instructions 
that make up the Fixed-Point Processor facility. 
Section 3.2, “Fixed-Point Processor Registers” 
describes the registers associated with the Fixed- 
Point Processor. Section 3.3, “Fixed-Point Processor 
Instructions” on page 31 describes the instructions 
associated with the Fixed-Point Processor. 


3.2 Fixed-Point Processor 
Registers 


3.3.6 Fixed-Point Move Assist 
Instructions 
3.3.7 Other Fixed-Point Instructions .. 48 
3.3.8 Fixed-Point Arithmetic Instructions 49 
3.3.9 Fixed-Point Compare Instructions 58 
3.3.10 Fixed-Point Trap Instructions .. 60 
3.3.11 Fixed-Point Logical Instructions 62 
3.3.12 Fixed-Point Rotate and Shift 
Instructions 
3.3.12.1 Fixed-Point Rotate Instructions 68 
3.3.12.2 Fixed-Point Shift Instructions . 74 
3.3.13 Move To/From System Register 
Instructions 


3.2.1 General Purpose Registers 


All manipulation of information is done in registers 
internal to the Fixed-Point Processor. The principal 
storage internal to the Fixed-Point Processor is a set 
of 32 General Purpose Registers (GPRs). See 
Figure 24. 


Figure 24. General Purpose Registers 


Each GPR is a 64-bit register. 
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3.2.2 Fixed-Point Exception Register 


The Fixed-Point Exception Register (XER) is a 64-bit 
register. 


XER 
0 63 


Figure 25. Fixed-Point Exception Register 


The bit definitions for the Fixed-Point Exception Reg- 
ister are shown below. Here M=0 in 64-bit mode and 
M=32 in 32-bit mode. 


The bits are set based on the operation of an instruc- 
tion considered as a whole, not on intermediate 
results (e.g., the Subtract From Carrying instruction, 
the result of which is specified as the sum of three 
values, sets bits in the Fixed-Point Exception Register 
based on the entire operation, not on an intermediate 
sum). 


Bit(s) Description 
0:31 Reserved 


32 Summary Overflow (SO) 

The Summary Overflow bit is set to 1 when- 
ever an instruction (except mtspr) sets the 
Overflow bit. Once set, the SO bit remains 
set until it is cleared by an mtspr instruction 
(specifying the XER) or an merxr instruction. 
It is not altered by Compare instructions, nor 
by other instructions (except mtspr to the 
XER, and merxr) that cannot overflow. Exe- 
cuting an mtspr instruction to the XER, sup- 
plying the values O for SO and 1 for OV, 
causes SO to be set to 0 and OV to be set to 
1. 


33 Overflow (OV) 

The Overflow bit is set to indicate that an 
overflow has occurred during execution of an 
instruction. XO-form Add, Subtract From, and 
Negate instructions having OE=1 set it to 1 if 
the carry out of bit M is not equal to the 
carry out of bit M+1, and set it to O other- 
wise.  XO-form Multiply Low and Divide 
instructions having OE=1 set it to 1 if the 
result cannot be represented in 64 bits 
(mulld, divd, divdu) or in 32 bits (mullw, divw, 
divwu), and set it to 0 otherwise. The OV bit 
is not altered by Compare instructions, nor 
by other instructions (except mtspr to the 
XER, and merxr) that cannot overflow. 


34 Carry (CA) 

The Carry bit is set as follows, during exe- 
cution of certain instructions. Add Carrying, 
Subtract From Carrying, Add Extended, and 
Subtract From Extended instructions set it to 
1 if there is a carry out of bit M, and set it to 
O otherwise. Shift Right Algebraic 
instructions set it to 1 if any 1-bits have been 
shifted out of a negative operand, and set it 
to 0 otherwise. The CA bit is not altered by 
Compare instructions, nor by other 
instructions (except Shift Right Algebraic, 
mtspr to the XER, and merxr) that cannot 
carry. 


35:56 Reserved 


57:63 This field specifies the number of bytes to be 
transferred by a Load String Indexed or Store 
String Indexed instruction. 


Compatibility Note 


For a discussion of POWER compatibility with 
respect to XER bits 48:55, see Appendix E, 
"Incompatibilities with the POWER Architecture" 
on page 163. For compatibility with future ver- 
sions of the PowerPC AS Architecture, these bits 
should be set to zero. 
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3.3 Fixed-Point Processor Instructions 


3.3.1 Fixed-Point Storage Access Instructions 


The Storage Access instructions compute the effective 
address (EA) of the storage to be accessed as 
described in Section 1.12.2, “Effective Address 
Calculation” on page 15. 


Programming Note 


The la extended mnemonic permits computing an 
effective address as a Load or Store instruction 
would, but loads the address itself into a GPR 
rather than loading the value that is in storage at 
that address. This extended mnemonic is 
described in Section  B.9, “Miscellaneous 
Mnemonics” on page 153. 


3.3.2 Fixed-Point Load Instructions 


The byte, halfword, word, or doubleword in storage 
addressed by EA is loaded into register RT. 


Many of the Load instructions have an “update” form, 
in which register RA is updated with the effective 
address. For these forms, if RA#0 and RA+ RT, the 
effective address is placed into register RA and the 
storage element (byte, halfword, word, or doubleword) 
addressed by EA is loaded into RT. 


Programming Note 


The DS field in DS-form Storage Access 
instructions is a word offset, not a byte offset like 
the D field in D-form Storage Access instructions. 
However, for programming convenience, Assem- 
blers should support the specification of byte 
offsets for both forms of instruction. 


3.3.1.1 Storage Access Exceptions 


Storage accesses will cause the system data storage 
error handler to be invoked if the program is not 
allowed to modify the target storage (Store only), or if 
the program attempts to access storage that is una- 
vailable. 


Programming Note 


In some implementations, the Load Algebraic and 
Load with Update instructions may have greater 
latency than other types of Load instructions. 
Moreover, Load with Update instructions may take 
longer to execute in some implementations than 
the corresponding pair of a non-update Load 
instruction and an Add instruction. 
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Load Byte and Zero D-form 


bz RT,D(RA) 


"Li 0 
else « (RA) 
EA € b + EXTS(D : 

RT € 560 | MEM(EA, 1) 


Let the effective address (EA) be the sum (RA|0)+D. 
The byte in storage addressed by EA is loaded into 
RTs6.63- RTo-55 are set to 0. 


Special Registers Altered: 
None 


Load Byte and Zero with Update 
D-form 


lbzu RT,D(RA) 


EA € (RA) + EXTS(D) 
RT € 560 | MEM(EA, 1) 
RA 4 EA 


Let the effective address (EA) be the sum (RA)+D. 
The byte in storage addressed by EA is loaded into 
RTs6:63- RToss are set to 0. 

EA is placed into register RA. 

If RA=0 or RA=RT, the instruction form is invalid. 


Special Registers Altered: 
None 
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Load Byte and Zero Indexed X-form 


Ibzx RT,RA,RB 


31 RT RA RB 87 / 
0 6 11 16 21 31 


AE 0 
else € (RA) 
EA € b + (RB) 

RT € 590 | MEM(EA, 1) 


Let the effective address (EA) be the sum 
(RAJO)+(RB). The byte in storage addressed by EA is 
loaded into RTsg.63. RToss are set to 0. 


Special Registers Altered: 
None 


Load Byte and Zero with Update 
Indexed X-form 


Ibzux RT,RA,RB 


31 RT RA RB 119 / 
0 6 11 16 21 31 


EA € (RA) + (RB) 
RT 450 | MEM(EA, 1) 
RA € EA 


Let the effective address (EA) be the sum (RA)+(RB). 
The byte in storage addressed by EA is loaded into 
RT56.63- RToss are set to 0. 

EA is placed into register RA. 

If RA=0 or RA=RT, the instruction form is invalid. 


Special Registers Altered: 
None 
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Load Halfword and Zero D-form 


Ihz RT,D(RA) 


40 RT RA D 
0 6 11 16 31 


if RA = 0 then b €0 
else b € (RA) 
EA € b + EXTS(D) 

RT € 480 | MEM(EA, 2) 


Let the effective address (EA) be the sum (RA|0)+D. 
The halfword in storage addressed by EA is loaded 
into RT4g.63. RTo:47 are set to 0. 


Special Registers Altered: 
None 


Load Halfword and Zero with Update 
D-form 


Ihzu RT,D(RA) 


EA € (RA) + EXTS(D) 
RT € 480 | MEM(EA, 2) 
RA € EA 


Let the effective address (EA) be the sum (RA)+D. 
The halfword in storage addressed by EA is loaded 
into RT4g.63. RTo:47 are set to 0. 

EA is placed into register RA. 

If RA=0 or RA=RT, the instruction form is invalid. 


Special Registers Altered: 
None 


Load Halfword and Zero Indexed 
X-form 


Ihzx RT,RA,RB 


31 RT RA RB 219 / 
0 6 11 16 21 31 


if RA = 0 then b €0 
else b € (RA) 
EA € b + (RB) 

RT «€ 480 | MEM(EA, 2) 


Let the effective address (EA) be the sum 
(RAJO)+(RB). The halfword in storage addressed by 
EA is loaded into RT 48-63: RT 9-47 are set to 0. 


Special Registers Altered: 
None 


Load Halfword and Zero with Update 
Indexed X-form 


lhzux RT,RA,RB 


EA € (RA) + (RB) 
RT € ^90 | MEM(EA, 2) 
RA € EA 


Let the effective address (EA) be the sum (RA)+(RB). 
The halfword in storage addressed by EA is loaded 
into RTag:63- RTo:47 are set to 0. 

EA is placed into register RA. 

If RA=0 or RA=RT, the instruction form is invalid. 


Special Registers Altered: 
None 
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Load Halfword Algebraic D-form 


ha RT,D(RA) 


42 RT RA D 
0 6 11 16 31 


if RA = 0 then b €0 
else b € (RA) 
EA € b + EXTS(D) 

RT € EXTS(MEM(EA, 2)) 


Let the effective address (EA) be the sum (RA|0)+D. 
The halfword in storage addressed by EA is loaded 
into RT 49.63. RToa4z are filled with a copy of bit O of 
the loaded halfword. 


Special Registers Altered: 
None 


Load Halfword Algebraic with Update 
D-form 


Ihau RT,D(RA) 


43 RT RA D 


EA € (RA) + EXTS(D) 
RT € EXTS(MEM(EA, 2)) 
RA € EA 


Let the effective address (EA) be the sum (RA)+D. 
The halfword in storage addressed by EA is loaded 
into RT4g.63- RTo:47 are filled with a copy of bit O of 
the loaded halfword. 


EA is placed into register RA. 
If RA=0 or RA=RT, the instruction form is invalid. 


Special Registers Altered: 
None 
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Load Halfword Algebraic Indexed 
X-form 


lhax RT,RA,RB 


31 RT RA RB 343 / 
0 6 11 16 21 31 


if RA = 0 then b €0 
else b € (RA) 
EA € b + (RB) 

RT € EXTS(MEM(EA, 2)) 


Let the effective address (EA) be the sum 
(RAJO)+(RB). The halfword in storage addressed by 
EA is loaded into RT¿g.43- RTo:47 are filled with a copy 
of bit O of the loaded halfword. 


Special Registers Altered: 
None 


Load Halfword Algebraic with Update 
Indexed X-form 


lhaux RT,RA,RB 


31 RT RA RB 375 / 


S 
o 
zt 
Em 
o 
N 
r 
w 
= 


EA € (RA) + (RB) 
RT € EXTS(MEM(EA, 2)) 
RA € EA 


Let the effective address (EA) be the sum (RA)+(RB). 
The halfword in storage addressed by EA is loaded 
into RT4g.63. RTo.47 are filled with a copy of bit O of 
the loaded halfword. 


EA is placed into register RA. 
If RA=0 or RA=RT, the instruction form is invalid. 


Special Registers Altered: 
None 
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Load Word and Zero D-form 


Iwz RT,D(RA) 
[POWER mnemonic: I] 


32 RT RA D 
0 6 11 16 31 


if RA = 0 then b €0 
else b € (RA) 
EA € b + EXTS(D) 

RT € 32) | MEM(EA, 4) 


Let the effective address (EA) be the sum (RA|0)+D. 
The word in storage addressed by EA is loaded into 
RT39.63- RTo:3 are set to 0. 


Special Registers Altered: 
None 


Load Word and Zero with Update 
D-form 


Iwzu RT,D(RA) 
[POWER mnemonic: lu] 


33 RT RA D 
0 6 11 16 31 


EA € (RA) + EXTS(D) 
RT ¢ 320 | MEM(EA, 4) 
RA € EA 


Let the effective address (EA) be the sum (RA)+D. 
The word in storage addressed by EA is loaded into 
RT32.53. RTog3 are set to 0. 

EA is placed into register RA. 

If RA=0 or RA=RT, the instruction form is invalid. 


Special Registers Altered: 
None 


Load Word and Zero Indexed X-form 


Iwzx RT,RA,RB 
[POWER mnemonic: Ix] 


31 RT RA RB 23 / 
0 6 11 16 21 31 


if RA = 0 then b €0 
else b € (RA) 
EA € b + (RB) 

RT € 320 | MEM(EA, 4) 


Let the effective address (EA) be the sum 
(RAJO)+(RB). The word in storage addressed by EA 
is loaded into RT42.53. RT9.4 are set to 0. 


Special Registers Altered: 
None 


Load Word and Zero with Update 
Indexed X-form 


Iwzux RT,RA,RB 
[POWER mnemonic: lux] 


31 RT RA RB 55 / 
0 6 11 16 21 31 


EA € (RA) + (RB) 
RT ¢ 320 | MEM(EA, 4) 
RA 4 EA 


Let the effective address (EA) be the sum (RA)+(RB). 
The word in storage addressed by EA is loaded into 
RT32:63- RTo:3/ are set to 0. 

EA is placed into register RA. 

If RA=0 or RA=RT, the instruction form is invalid. 


Special Registers Altered: 
None 
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Load Word Algebraic DS-form 


wa RT,DS(RA) 


58 RT RA DS 2 
0 6 11 16 30 31 


if RA = 0 then b €0 
else b € (RA) 
EA € b + EXTS(DS | 0b00) 
RT € EXTS(MEM(EA, 4)) 


Let the effective address (EA) be the sum 
(RA|0)+(DS |0b00). The word in storage addressed by 
EA is loaded into RT39.¢3._ RTo:3 are filled with a copy 
of bit O of the loaded word. 


Special Registers Altered: 
None 
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Load Word Algebraic Indexed X-form 


Iwax RT,RA,RB 


31 RT RA RB 341 / 
0 6 11 16 21 31 


if RA = 0 then b €0 
else b € (RA) 
EA ¢ b + (RB) 

RT € EXTS(MEM(EA, 4)) 


Let the effective address (EA) be the sum 
(RAJO)-(RB). The word in storage addressed by EA 
is loaded into RTz>g3. RTo44 are filled with a copy of 
bit O of the loaded word. 


Special Registers Altered: 
None 


Load Word Algebraic with Update 
Indexed X-form 


Iwaux RT,RA,RB 


31 RT RA RB 373 / 
0 6 11 16 21 31 


EA € (RA) + (RB) 
RT € EXTS(MEM(EA, 4)) 
RA € EA 


Let the effective address (EA) be the sum (RA)+(RB). 
The word in storage addressed by EA is loaded into 
RTx32:63- RTo3 are filled with a copy of bit O of the 
loaded word. 


EA is placed into register RA. 
If RA=0 or RA=RT, the instruction form is invalid. 


Special Registers Altered: 
None 
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Load Doubleword DS-form 


ld RT,DS(RA) 


0 
30 31 


if RA = 0 then b €0 
else b € (RA) 
EA € b + EXTS(DS | 0b00) 
RT € MEM(EA, 8) 


Let the effective address (EA) be the sum 
(RA|0)+(DS |0b00). The doubleword in storage 
ie by EA is loaded into RT. 


Special Registers Altered: 
None 


Load Doubleword with Update DS-form 


Idu RT,DS(RA) 


1 
30 31 


EA € (RA) + EXTS(DS | 0b00) 
RT € MEM(EA, 8) 
RA € EA 


Let the effective address (EA) be the sum 
(RA)+(DS |0b00). The doubleword in storage 
addressed by EA is loaded into RT. 

EA is placed into register RA. 

If RA=0 or RA=RT, the instruction form is invalid. 


Special Registers Altered: 
None 


Load Doubleword Indexed X-form 


Idx RT,RA,RB 


31 RT RA RB 21 / 
0 6 11 16 21 31 


EP 0 
else € (RA) 
EA € b + (RB) 

RT € MEM(EA, 8) 


Let the effective address (EA) be the sum 
(RAJO)+(RB). The doubleword in storage addressed 
by EA is loaded into RT. 


Special Registers Altered: 
None 


Load Doubleword with Update Indexed 
X-form 


Idux RT,RA,RB 


31 RT RA RB 53 / 
0 6 11 16 21 31 


EA € (RA) + (RB) 
RT € MEM(EA, 8) 
RA € EA 


Let the effective address (EA) be the sum (RA)+(RB). 
The doubleword in storage addressed by EA is loaded 
into RT. 

EA is placed into register RA. 

If RA=0 or RA=RT, the instruction form is invalid. 


Special Registers Altered: 
None 
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3.3.3 Fixed-Point Store Instructions 


The contents of register RS are stored into the byte, 
halfword, word, or doubleword in storage addressed 
by EA. 


Many of the Store instructions have an “update” form, 
in which register RA is updated with the effective 
address. For these forms, the following rules apply. 


Store Byte D-form 


stb RS,D(RA) 


RS 
31 


MEL 0 
else « (RA) 
EA € b + EXTS(D : 

MEM (EA, 1) € (RS) 56:63 


Let the effective address (EA) be the sum (RA|0)+D. 
(RS)s6:63 are stored into the byte in storage addressed 
by EA. 


Special Registers Altered: 
None 


Store Byte with Update D-form 


stbu RS,D(RA) 


RS 
31 


EA € (RA) + EXTS(D) 
MEM(EA, 1) € (RS) 56:63 
RA € EA 


Let the effective address (EA) be the sum (RA)+D. 
(RS)s6:63 are stored into the byte in storage addressed 


by EA 
EA is placed into register RA. 
If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 
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m |f RAzO, the effective address is placed into reg- 
ister RA. 


m If RS=RA, the contents of register RS are copied 
to the target storage element and then EA is 
placed into RA (RS). 


Store Byte Indexed X-form 


stbx RS,RA,RB 


31 RS RA RB 215 / 
0 6 11 16 21 31 


MN 0 
else « (RA) 
EA € b + (RB) 

MEM (EA, 1) € (RS) 56:63 


Let the effective address (EA) be the sum 
(RAJO)+(RB). (RS)seeg are stored into the byte in 
storage addressed by EA. 


Special Registers Altered: 
None 


Store Byte with Update Indexed X-form 


stbux RS,RA,RB 


31 RS RA RB 247 / 
0 6 11 16 21 31 


EA € (RA) + (RB) 
MEM(EA, 1) € (RS) 56.63 
RA € EA 


Let the effective address (EA) be the sum (RA)+(RB). 
(RS)s6:63 are stored into the byte in storage addressed 
by EA. 


EA is placed into register RA. 
If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 


38 PowerPC AS User Instruction Set Architecture, Tags Inactive Subset 


IBM Confidential - Feb. 24, 1999 


Store Halfword D-form 


sth RS,D(RA) 


RS 
31 


oe ee 0 
else € (RA) 
EA € b + EXTS(D : 

MEM (EA, 2) + (RS) 48:63 


Let the effective address (EA) be the sum (RA|0)+D. 
(RS)4g-63 are stored into the halfword in storage 
addressed by EA. 


Special Registers Altered: 
None 


Store Halfword with Update D-form 


sthu RS,D(RA) 


RS 
31 


EA € (RA) + EXTS(D) 
MEM (EA, 2) + (RS) 48:63 
RA € EA 


Let the effective address (EA) be the sum (RA)+D. 
(RS)4g:63 are stored into the halfword in storage 
addressed by EA. 

EA is placed into register RA. 


If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 


Store Halfword Indexed X-form 


sthx RS,RA,RB 


31 RS RA RB 407 / 
0 6 11 16 21 31 


A 0 
else € (RA) 
EA € b + (RB) 

MEM (EA, 2) + (RS) 48:63 


Let the effective address (EA) be the sum 
(RAJO)+(RB). (RS)4g:63 are stored into the halfword in 
storage addressed by EA. 


Special Registers Altered: 
None 


Store Halfword with Update Indexed 
X-form 


sthux RS,RA,RB 


31 RS RA RB 439 / 
0 6 11 16 21 31 


EA € (RA) + (RB) 
MEM(EA, 2) € (RS) 48:63 
RA € EA 


Let the effective address (EA) be the sum (RA)+(RB). 
(RS)as:6g are stored into the halfword in storage 
addressed by EA. 

EA is placed into register RA. 


If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 
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Store Word D-form 


stw RS,D(RA) 
[POWER mnemonic: st] 


36 RS RA D 
0 6 11 16 31 


if RA = 0 then b €0 
else b € (RA) 
EA € b + EXTS(D) 
MEM(EA, 4) € (RS) 30:63 


Let the effective address (EA) be the sum (RA|0)+D. 
(RS)s2:63 are stored into the word in storage 
addressed by EA. 


Special Registers Altered: 
None 


Store Word with Update D-form 


stwu RS,D(RA) 
[POWER mnemonic: stu] 


EA € (RA) + EXTS(D) 
MEM(EA, 4) € (RS) 39.63 
RA € EA 


Let the effective address (EA) be the sum (RA)+D. 
(RS)s263 are stored into the word in storage 
addressed by EA. 

EA is placed into register RA. 


If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 
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Store Word Indexed X-form 


stwx RS,RA,RB 
[POWER mnemonic: stx] 


31 RS RA RB 151 / 
0 6 11 16 21 31 


if RA = 0 then b €0 
else b € (RA) 
EA € b + (RB) 

MEM (EA, 4) € (RS) 30:63 


Let the effective address (EA) be the sum 
(RA|O)+(RB). (RS)30.¢3 are stored into the word in 
storage addressed by EA. 


Special Registers Altered: 
None 


Store Word with Update Indexed X-form 


stwux RS,RA,RB 
[POWER mnemonic: stux] 


EA € (RA) + (RB) 
MEM(EA, 4) € (RS) 39.63 
RA € EA 


Let the effective address (EA) be the sum (RA)+(RB). 
(RS)3263 are stored into the word in storage 
addressed by EA. 

EA is placed into register RA. 


If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 
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Store Doubleword DS-form 


std RS,DS(RA) 


RS 0 
30 31 


if RA = 0 then b €0 
else b € (RA) 
EA € b + EXTS(DS | 0b00) 
MEM(EA, 8) € (RS) 


Let the effective address (EA) be the sum 
(RA|0)+(DS |0b00). (RS) is stored into the 
vigneti in storage addressed by EA. 


Special Registers Altered: 
None 


Store Doubleword with Update DS-form 


stdu RS,DS(RA) 


RS 1 
30 31 


EA € (RA) + EXTS(DS | 0b00) 
MEM(EA, 8) € (RS) 
RA € EA 


Let the effective address (EA) be the sum 
(RA)+(DS |0b00). (RS) is stored into the doubleword 
in storage addressed by EA. 

EA is placed into register RA. 


If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 


Store Doubleword Indexed X-form 


stdx RS,RA,RB 


31 RS RA RB 149 / 
0 6 11 16 21 31 


Pre 0 
else « (RA) 
EA €b + (RB) 

MEM(EA, 8) # (RS) 


Let the effective address (EA) be the sum 
(RAJO)+(RB). (RS) is stored into the doubleword in 
storage addressed by EA. 


Special Registers Altered: 
None 


Store Doubleword with Update Indexed 
X-form 


stdux RS,RA,RB 


31 RS RA RB 181 / 
0 6 11 16 21 31 


EA € (RA) + (RB) 
MEM(EA, 8) # (RS) 
RA € EA 


Let the effective address (EA) be the sum (RA)+(RB). 
(RS) is stored into the doubleword in storage 
addressed by EA. 

EA is placed into register RA. 


If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 
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3.3.4 Fixed-Point Load and Store with Byte Reversal Instructions 


Programming Note 


These instructions have the effect of loading and 
storing data in Little-Endian byte order. 


In some implementations, the Load Byte-Reverse 
instructions may have greater latency than other 
Load instructions. 


Load Halfword Byte-Reverse Indexed 
X-form 


Ihbrx RT,RA,RB 


31 RT RA RB 790 / 
0 6 11 16 21 31 


if RA = 0 then b €0 

else b «€ (RA) 

EA € b * (RB) 

RT € 480 | MEM(EA+1, 1) | MEM(EA, 1) 


Let the effective address (EA) be the sum 
(RAJO)+(RB). Bits 0:7 of the halfword in storage 
addressed by EA are loaded into RTsg:g3. Bits 8:15 of 
the halfword in storage addressed by EA are loaded 
into RT4g.55. RTo:47 are set to 0. 


Special Registers Altered: 
None 


Load Word Byte-Reverse Indexed 
X-form 


Iwbrx RT,RA,RB 
[POWER mnemonic: Ibrx] 


31 RT RA RB 534 / 
0 6 11 16 21 31 


if RA = 0 then b €0 
else b € (RA) 
EA € b + (RB) 
RT 4320 Bede 1) pouce 1) 
MEM (EA+1, 1) | MEM(EA, 1) 
Let the effective address (EA) be the sum 


(RA|0) - (RB). Bits 0:7 of the word in storage 
addressed by EA are loaded into RTsg:g3. Bits 8:15 of 
the word in storage addressed by EA are loaded into 
RT4gss. Bits 16:23 of the word in storage addressed 
by EA are loaded into RT49.47. Bits 24:31 of the word 
in storage addressed by EA are loaded into RT35.39. 
RT 9-31 are set to 0. 


Special Registers Altered: 
None 
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Store Halfword Byte-Reverse Indexed 
X-form 


sthbrx 


31 RS RA RB 918 / 
0 6 11 16 21 31 


if RA = 0 then b €0 

else b € (RA) 

EA € b + (RB) 

MEM(EA, 2) € (RS) 56.63 | (RS) 48:55 


RS,RA,RB 


Let the effective address (EA) be the sum 
(RAJO)+(RB). (RS)s¢.g3 are stored into bits 0:7 of the 
halfword in storage addressed by EA. (RS)4g-55 are 
stored into bits 8:15 of the halfword in storage 
addressed by EA. 


Special Registers Altered: 
None 


Store Word Byte-Reverse Indexed 
X-form 


stwbrx RS,RA,RB 
[POWER mnemonic: stbrx] 


if RA = 0 then b €0 

else b € (RA) 

EA € b + (RB) 

MEM(EA, 4) € (RS)56:63 | (RS)4g55 | (RS) 40:47 | (RS) 32:39 


Let the effective address (EA) be the sum 
(RA|O)+(RB). (RS)gggs are stored into bits 0:7 of the 
word in storage addressed by EA. (RS)4g:55 are stored 
into bits 8:15 of the word in storage addressed by EA. 
(RS)40:47 are stored into bits 16:23 of the word in 
storage addressed by EA. (RS)39.39 are stored into 
bits 24:31 of the word in storage addressed by EA. 


Special Registers Altered: 
None 


Chapter 3. Fixed-Point Processor 43 


IBM Confidential - Feb. 24, 1999 


3.3.5 Fixed-Point Load and Store Multiple Instructions 


The Load/Store Multiple instructions have preferred 
forms; see Section 1.9.1, “Preferred Instruction 
Forms” on page 12. In the preferred forms, storage 
alignment satisfies the following rule. 


m The combination of the EA and RT (RS) is such 
that the low-order byte of GPR 31 is loaded 
(stored) from (into) the last byte of an aligned 
quadword in storage. 


Load Multiple Word D-form 


Imw RT,D(RA) 
[POWER mnemonic: Im] 


46 RT RA D 
0 6 11 16 31 


if RA = 0 then b €0 
else b € (RA) 
EA € b + EXTS(D) 

r €RI 


do while r < 31 
GPR(r) € 920 | MEM(EA, 4) 
rtr+1 
EA €EA + 4 


Let n = (32- RT). Let the effective address (EA) be 
the sum (RA|0)+D. 


n consecutive words starting at EA are loaded into 
the low-order 32 bits of GPRs RT through 31. The 
high-order 32 bits of these GPRs are set to zero. 


EA must be a multiple of 4. |f it is not, either the 
system alignment error handler is invoked or the 
results are boundedly undefined. 


If RA is in the range of registers to be loaded, 
including the case in which RA=0, the instruction 
form is invalid. 


Special Registers Altered: 
None 


Compatibility Note 


For a discussion of POWER compatibility with 
respect to the alignment of the EA for the Load 
Multiple Word and Store Multiple Word 


instructions, see Appendix E, “Incompatibilities 
with the POWER Architecture” on page 163. For 
compatibility with future versions of the PowerPC 
AS Architecture, these EAs should be word- 
aligned. 


Engineering Note 


Causing the system alignment error handler to be 
invoked if attempt is made to execute a Load Mul- 
tiple or Store Multiple instruction having an incor- 
rectly aligned effective address facilitates the 
debugging of software. 


Store Multiple Word D-form 


stmw RS,D(RA) 
[POWER mnemonic: stm] 


47 RS RA D 
0 6 11 16 31 


if RA = 0 then b €0 
else b € (RA) 
EA € b + EXTS(D) 

r «€ RS 


do while r < 31 
reri 
EA €EA + 4 


Let n = (32-RS). Let the effective address (EA) be 
the sum (RA|0)+D. 


n consecutive words starting at EA are stored from 
the low-order 32 bits of GPRs RS through 31. 


EA must be a multiple of 4. If it is not, either the 
system alignment error handler is invoked or the 
results are boundedly undefined. 


Special Registers Altered: 
None 
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3.3.6 Fixed-Point Move Assist Instructions 


The Move Assist instructions allow movement of data 
from storage to registers or from registers to storage 
without concern for alignment. These instructions can 
be used for a short move between arbitrary storage 
locations or to initiate a long move between unaligned 
storage fields. 


The Load/Store String instructions have preferred 
forms; see Section 1.9.1, “Preferred Instruction 
Forms” on page 12. In the preferred forms, register 
usage satisfies the following rules. 


m RS= 4or5 
m RT= 4or5 
m last register loaded/stored < 12 


For some implementations, using GPR 4 for RS and 
RT may result in slightly faster execution than using 
GPR 5; see Book IV, PowerPC AS Implementation 
Features. 


Architecture Note 


The preferred register for RS and RT in PowerPC 
is GPR 5. 
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Load String Word Immediate X-form 


Iswi RT,RA,NB 
[POWER mnemonic: Isi] 


31 RT RA NB 597 / 
0 6 11 16 21 31 


if RA = 0 then EA €0 
else EA € (RA) 
if NB = 0 then n € 32 
else n €NB 
r«*RT-1 
i € 32 
do while n > 0 

if i = 32 then 
r €r +1 (mod 32) 
GPR(r) €0 
GPR I)i:i,7 € MEM (EA, 1) 
i€i-*8 
if i = 64 then i ¢ 32 
EA € EA + 1 
n*n-1 


Let the effective address (EA) be (RAJO). Let n = NB 
if NBz0, n = 32if NB=0; n is the number of bytes to 
load. Let nr = CElL(n- 4); nr is the number of regis- 
ters to receive data. 


n consecutive bytes starting at EA are loaded into 
GPRs RT through RT+nr-1. Data are loaded into the 
low-order four bytes of each GPR; the high-order four 
bytes are set to 0. 


Bytes are loaded left to right in each register. The 
sequence of registers wraps around to GPR O if 
required. If the low-order four bytes of register 
RT+nr—1 are only partially filled, the unfilled low- 
order byte(s) of that register are set to O. 


If RA is in the range of registers to be loaded, 
including the case in which RA=0, the instruction 
form is invalid. 


Special Registers Altered: 
None 


Load String Word Indexed X-form 


Iswx RT,RA,RB 
[POWER mnemonic: Isx] 


31 RT RA RB 533 / 
0 6 11 16 21 31 


if RA = 0 then b €0 
else b € (RA) 
EA € b + (RB) 
n € XER57:53 
r4RT-I 
i € 32 
RT € undefined 
do while n > 0 

if i = 32 then 
rtr+1 (mod 32) 
GPR(r) €0 
GPR P» € MEM (EA, 1) 
i€i-*8 
if i = 64 then i € 32 
EA € EA + 1 
n*n-1 


Let the effective address (EA) be the sum 
(RAJO)+(RB). Let n = XERgzgs n is the number of 
bytes to load. Let nr = CEIL(n= 4); nr is the number 
of registers to receive data. 


If n>0, n consecutive bytes starting at EA are loaded 
into GPRs RT through RT+nr-1. Data are loaded 
into the low-order four bytes of each GPR; the high- 
order four bytes are set to 0. 


Bytes are loaded left to right in each register. The 
sequence of registers wraps around to GPR O if 
required. If the low-order four bytes of register 
RT«nr-1 are only partially filled, the unfilled low- 
order byte(s) of that register are set to O. 


If n=0, the contents of register RT are undefined. 


If RA or RB is in the range of registers to be loaded, 
including the case in which RA=0, either the system 
illegal instruction error handler is invoked or the 
results are boundedly undefined. If RT=RA or 
RT=RB, the instruction form is invalid. 


Special Registers Altered: 
None 
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Store String Word Immediate X-form 


stswi RS,RA,NB 
[POWER mnemonic: stsi] 


31 RS RA NB 725 / 
0 6 11 16 21 31 


if RA = 0 then EA € 0 
else EA € (RA) 
if NB = 0 then n € 32 
else n €NB 

r éR -1 

i € 32 


do while n > 0 

if i = 32 then r € r + 1 (mod 32) 
MEM (EA, 1) € GPR(r);.i,7 
i€i-*8 

if i = 64 then i € 32 

EA €EA + 1 

n*n-1 


Let the effective address (EA) be (RAJO). Let n = NB 
if NBz0, n = 32 if NB=0; n is the number of bytes to 
store. Let nr = CElL(n- 4); nr is the number of regis- 
ters to supply data. 


n consecutive bytes starting at EA are stored from 
GPRs RS through RS+nr-1. Data are stored from 
the low-order four bytes of each GPR. 


Bytes are stored left to right from each register. The 
sequence of registers wraps around to GPR O if 
required. 


Special Registers Altered: 
None 


Store String Word Indexed X-form 


stswx RS,RA,RB 
[POWER mnemonic: stsx] 


31 RS RA RB 661 / 
0 6 11 16 21 31 


if RA = 0 then b €0 
else b € (RA) 
EA € b + (RB) 
n € XERg; 63 
r RS -1 
i € 32 
do while n > 0 
if i = 32 then r € r + 1 (mod 32) 
MEM (EA, 1) € GPR(r);.i,7 
i€i-*8 
if i = 64 then i € 32 
EA € EA + 1 
n*n-1 


Let the effective address (EA) be the sum 
(RAJO)+(RB). Let n = XERzzgs n is the number of 
bytes to store. Let nr = CEIL(n= 4); nr is the number 
of registers to supply data. 


If n>0, n consecutive bytes starting at EA are stored 
from GPRs RS through RS«nr- 1. Data are stored 
from the low-order four bytes of each GPR. 


Bytes are stored left to right from each register. The 
sequence of registers wraps around to GPR O if 
required. 


If n=0, no bytes are stored. 


Special Registers Altered: 
None 
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3.3.7 Other Fixed-Point Instructions 


The remainder of the fixed-point instructions use the 
contents of the General Purpose Registers (GPRs) as 
source operands, and place results into GPRs, into the 
Fixed-Point Exception Register (XER), and into Condi- 
tion Register fields. In addition, the Trap instructions 
compare the contents of one GPR with a second GPR 
or immediate data and, if the specified conditions are 
met, invoke the system trap handler. 


These instructions treat the source operands as 
signed integers unless the instruction is explicitly 
identified as performing an unsigned operation. 


The X-form and XO-form instructions with Rc=1, and 
the D-form instructions addic., andi., and andis., set 
the first three bits of CR Field O to characterize the 


result placed into the target register. In 64-bit mode, 
these bits are set by signed comparison of the result 
to zero. In 32-bit mode, these bits are set by signed 
comparison of the low-order 32 bits of the result to 
zero. 


Unless otherwise noted and when appropriate, when 
CR Field 0 and the XER are set they reflect the value 
placed into the target register. 


Programming Note 


Instructions with the OE bit set or that set CA may 
execute slowly or may prevent the execution of 
subsequent instructions until the instruction has 
completed. 
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3.3.8 Fixed-Point Arithmetic Instructions 


The XO-form Arithmetic instructions with Rc=1, and 
the D-form Arithmetic instruction addic., set the first 
three bits of CR Field O as described in Section 3.3.7, 
“Other Fixed-Point Instructions” on page 48. 


addic, addic., subfic, addc, subfc, adde, subfe, addme, 
subfme, addze, and subfze always set CA, to reflect 
the carry out of bit O in 64-bit mode and out of bit 32 
in 32-bit mode. The XO-form Arithmetic instructions 
set SO and OV when OE=1 to reflect overflow of the 
result. Except for the Multiply Low and Divide 
instructions, the setting of these bits is mode- 
dependent, and reflects overflow of the 64-bit result in 
64-bit mode and overflow of the low-order 32-bit 
result in 32-bit mode. For XO-form Multiply Low and 
Divide instructions, the setting of these bits is mode- 
independent, and reflects overflow of the 64-bit result 
for mulld, divd, and divdu, and overflow of the low- 
order 32-bit result for mullw, divw, and divwu. 


Programming Note 


Notice that CR Field 0 may not reflect the “true” 
(infinitely precise) result if overflow occurs. 


Add Immediate D-form 


addi RT,RA,SI 
[POWER mnemonic: cal] 


14 RT RA SI 
0 6 11 16 31 


if RA = 0 then RT € EXTS (SI) 
else RT € (RA) + EXTS(SI) 


The sum (RAJO) + Sl is placed into register RT. 


Special Registers Altered: 
None 


Extended Mnemonics: 
Examples of extended mnemonics for Add Immediate: 


Extended: Equivalent to: 

li Rx,value addi Rx,0,value 

la Rx,disp(Ry) addi Rx,Ry,disp 
subi Rx,Ry,value addi  Rx,Ry,- value 


Programming Note 


addi, addis, add, and subf are the preferred 
instructions for addition and subtraction, because 
they set few status bits. 


Notice that addi and addis use the value 0, not the 
contents of GPR 0, if RA=0. 


Extended mnemonics for addition and 
subtraction 


Several extended mnemonics are provided that use 
the Add Immediate and Add Immediate Shifted 
instructions to load an immediate value or an address 
into a target register. Some of these are shown as 
examples with the two instructions. 


The PowerPC AS Architecture supplies Subtract From 
instructions, which subtract the second operand from 
the third. A set of extended mnemonics is provided 
that use the more “normal” order, in which the third 
operand is subtracted from the second, with the third 
operand being either an immediate field or a register. 
Some of these are shown as examples with the appro- 
priate Add and Subtract From instructions. 


See Appendix B, “Assembler Extended Mnemonics” 
on page 143 for additional extended mnemonics. 


Add Immediate Shifted D-form 


addis RT,RA,SI 
[POWER mnemonic: cau] 


15 RT RA SI 
0 6 11 16 31 


if RA = 0 then RT € EXTS(SI | 160) 
else RT € (RA) + EXTS(SI | 180) 


The sum (RAIO) + (SI | 0x0000) is placed into register 
RT. 


Special Registers Altered: 
None 
Extended Mnemonics: 
Examples of extended mnemonics for Add Immediate 
Shifted: 
Extended: 


lis Rx,value 
subis | Rx, Ry,value 


Equivalent to: 


addis Rx,0,value 
addis Rx,Ry,- value 
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Add XO-form 

add RT,RA,RB (OE=0 Rc=0) 
add. RT,RA,RB (OE=0 Rc-1) 
addo RT,RA,RB (OE=1 Rc=0) 
addo. RT,RA,RB (OE=1 Rc-1) 


[POWER mnemonics: cax, cax., caxo, caxo.] 


31 RT RA RB |OE 266 Re 
0 6 11 16 21 |22 31 
RT € (RA) + (RB) 
The sum (RA) + (RB) is placed into register RT. 
Special Registers Altered: 


CRO (if Re=1) 
SO OV (if OE=1) 


Add Immediate Carrying D-form 


addic RT,RA,SI 
[POWER mnemonic: ai] 


12 RT RA SI 
0 6 11 16 31 


RT € (RA) + EXTS(SI) 
The sum (RA) + Sl is placed into register RT. 
Special Registers Altered: 
CA 
Extended Mnemonics: 
Example of extended mnemonics for Add Immediate 


Carrying: 


Extended: 
subic Rx,Ry,value 


Equivalent to: 
addic  Rx,Ry,- value 
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Subtract From XO-form 


subf RT,RA,RB (OE=0 Rc=0) 
subf. RT,RA,RB (OE=0 Re=1) 
subfo RT,RA,RB (OE=1 Rc20) 
subfo. RT,RA,RB (OE=1 Re=1) 


31 RT RA RB |OE 40 Rc 
0 6 11 16 21 |22 31 
RT € ^(RA) * (RB) * 1 
The sum =(RA) + (RB) +1 is placed into register 


RT. 


Special Registers Altered: 
CRO (if Rc=1) 
SO OV (if OE=1) 
Extended Mnemonics: 
Example of extended mnemonics for Subtract From: 


Extended: Equivalent to: 
sub Rx, Ry,Rz subf Rx,Rz,Ry 


Add Immediate Carrying and Record 
D-form 


addic. RT,RA,SI 
[POWER mnemonic: ai.] 


RT € (RA) + EXTS(SI) 
The sum (RA) + Sl is placed into register RT. 
Special Registers Altered: 
CRO CA 
Extended Mnemonics: 
Example of extended mnemonics for Add Immediate 


Carrying and Record: 


Extended: 
subic. Rx,Ry,value 


Equivalent to: 
addic. Rx,Ry,- value 
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Subtract From Immediate Carrying 
D-form 


subfic RT,RA,SI 
[POWER mnemonic: sfi] 


RT € (RA) + EXTS(SI) + 1 
The sum =(RA) + Sl+ 1 is placed into register RT. 


Special Registers Altered: 
CA 


Add Carrying XO-form 


addc RT,RA,RB (OE=0 Rc=0) 
addc. RT,RA,RB (OE=0 Rc-1) 
addco RT,RA,RB (OE=1 Rc-0) 
addco. RT,RA,RB (OE=1 Rc=1) 


[POWER mnemonics: a, a., ao, ao.] 


31 RT RA RB |OE 10 Rc 
0 6 11 16 21 |22 31 
RT € (RA) + (RB) 
The sum (RA) + (RB) is placed into register RT. 


Special Registers Altered: 


CA 
CRO (if Ro=1) 
SO OV (if OE=1) 


Subtract From Carrying XO-form 


subfc RT,RA,RB (OE=0 Rc20) 
subfc. RT,RA,RB (OE=0 Re=1) 
subfco RT,RA,RB (OE=1 Rc20) 
subfco. RT,RA,RB (OE=1 Re=1) 


[POWER mnemonics: sf, sf., sfo, sfo.] 


31 RT RA RB |OE 8 Rc 
0 6 11 16 21 |22 31 
RT € 7(RA) + (RB) + 1 


The sum =(RA) + (RB) + 1 is placed into register 
RT. 


Special Registers Altered: 


CA 
CRO (if Re=1) 
SO OV (if OE=1) 


Extended Mnemonics: 
Example of extended mnemonics for Subtract From 
Carrying: 


Extended: 
subc  Rx,Ry,Rz 


Equivalent to: 
subfc Rx,Rz,Ry 
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Add Extended XO-form 


adde RT,RA,RB (OE=0 Rc=0) 
adde. RT,RA,RB (OE=0 Rc-1) 
addeo RT,RA,RB (OE=1 Rc=0) 
addeo. RT,RA,RB (OE=1 Rc=1) 


[POWER mnemonics: ae, ae., aeo, aeo.] 


31 RT RA RB |OE 138 Rc 
0 6 11 16 21 |22 31 
RT € (RA) + (RB) + CA 


The sum (RA) + (RB) + CA is placed into register 
RT. 


Special Registers Altered: 


CA 
CRO (if Re=1) 
SO OV (if OE=1) 


Add to Minus One Extended XO-form 
addme RT,RA 
addme. RT,RA 


( ) 

( ) 

addmeo RT,RA (OE=1 Rc=0) 

addmeo. RIT,RA ( ) 
o 


31 RT RA 111. [OE 234 Re 
0 6 11 16 21 |22 31 


RT € (RA) + CA — 1 
The sum (RA) + CA + 8% is placed into register RT. 


Special Registers Altered: 


CA 
CRO (if Re=1) 
SO OV (if OE=1) 
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Subtract From Extended XO-form 


subfe RT,RA,RB (OE=0 Rc=0) 
subfe. RT,RA,RB (OE=0 Re=1) 
subfeo RT,RA,RB (OE=1 Rc20) 
subfeo. RT,RA,RB (OE=1 Re=1) 


[POWER mnemonics: sfe, sfe., sfeo, sfeo.] 


31 RT RA RB |OE 136 Re 
0 6 11 16 21 | 22 31 
RT € 7(RA) + (RB) + CA 


The sum (RA) + (RB) + CA is placed into register 
RT. 


Special Registers Altered: 


CA 
CRO (if Re=1) 
SO OV (if OE=1) 


Subtract From Minus One Extended 
XO-form 


subfme RT,RA (OE=0 Rc=0) 
subfme. RT,RA (OE=0 Rc=1) 
subfmeo | RT,RA (OE=1 Rc=0) 
subfmeo. RT,RA (OE=1 Rc=1) 


[POWER mnemonics: sfme, sfme., sfmeo, sfmeo.] 


31 RT RA 111 [OE 232 Re 
0 6 11 16 21 |22 31 


RT € (RA) + CA — 1 
The sum =(RA) + CA + HM is placed into register 
RT. 


Special Registers Altered: 
CA 
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CRO 


(if Ro=1) 
(if OE=1) 
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Add to Zero Extended XO-form 


addze RT,RA (OE=0 Rc=0) 
addze. RT,RA (OE=0 Rc=1) 
addzeo RT,RA (OE=1 Rc=0) 
addzeo. RT,RA (OE=1 Rc=1) 


[POWER mnemonics: aze, aze., azeo, azeo.] 


31 RT RA //| [OE 202 Re 
0 6 11 16 21 |22 31 


RT € (RA) + CA 
The sum (RA) + CA is placed into register RT. 


Special Registers Altered: 


CA 
CRO (if Re=1) 
SO OV (if OE=1) 


Subtract From Zero Extended XO-form 


subfze RT,RA (OE=0 Rc=0) 
subfze. RT,RA (OE=0 Rc=1) 
subfzeo RT,RA (OE=1 Rc=0) 
subfzeo. RT,RA (OE=1 Rc=1) 


[POWER mnemonics: sfze, sfze., sfzeo, sfzeo.] 


31 RT RA Hl |OE 200 [Re 
0 6 11 16 21 |22 31 


RT € 7(RA) + CA 
The sum =(RA) + CA is placed into register RT. 


Special Registers Altered: 


CA 
CRO (if Re=1) 
SO OV (if OE=1) 


Programming Note 


The setting of CA by the Add and Subtract From 
instructions, including the Extended versions 
thereof, is mode-dependent. If a sequence of 
these instructions is used to perform extended- 
precision addition or subtraction, the same mode 
should be used throughout the sequence. 


Negate XO-form 


neg RT,RA (OE=0 Rc=0) 
neg. RT,RA (OE=0 Rc=1) 
nego RT,RA (OE=1 Rc=0) 
nego. RT,RA (OE=1 Rc=1) 


RT € 7(RA) + 1 
The sum =(RA) + 1 is placed into register RT. 


If the processor is in 64-bit mode and register RA 
contains the most negative 64-bit number (0x8000 | 
0000 0000 0000), the result is the most negative 
number and, if OE=1, OV is set to 1. Similarly, if the 
processor is in 32-bit mode and (RA)g39.¢3 contain the 
most negative 32-bit number (0x8000 0000), the low- 
order 32 bits of the result contain the most negative 
32-bit number and, if OE=1, OV is set to 1. 


Special Registers Altered: 


CRO (if Re=1) 
SO OV (if OE=1) 
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Multiply Low Immediate D-form 


mulli RT,RA,SI 
[POWER mnemonic: muli] 


prodo:127 + (RA) x EXTS (SI) 

RT € prodg4:127 

The 64-bit first operand is (RA). The 64-bit second 
operand is the sign-extended value of the SI field. 
The low-order 64 bits of the 128-bit product of the 
operands are placed into register RT. 


Both operands and the product are interpreted as 
signed integers. 


Special Registers Altered: 
None 


Multiply Low Doubleword XO-form 


mulld RT,RA,RB (OE=0 Rc=0) 
mulld. RT,RA,RB (OE=0 Rc-1) 
mulldo RT,RA,RB (OE=1 Rc=0) 
mulldo.  RT,RA,RB (OE=1 Rc-1) 

31 RT RA RB |oE| 233 [Re 
0 6 11 16 21 |22 31 


prodo:197 € (RA) x (RB) 

RT € prodg4427 

The 64-bit operands are (RA) and (RB). The low-order 
64 bits of the 128-bit product of the operands are 
placed into register RT. 


If OE=1 then OV is set to 1 if the product cannot be 
represented in 64 bits. 


Both operands and the product are interpreted as 
signed integers. 


Special Registers Altered: 
CRO (if Rc=1) 
(if OE=1) 


SO OV 


Programming Note 


The XO-form Multiply instructions may execute 
faster on some implementations if RB contains 
the operand having the smaller absolute value. 
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Multiply Low Word XO-form 


mullw RT,RA,RB (OE=0 Rc=0) 
mullw. RT,RA,RB (OE=0 Re=1) 
mullwo RT,RA,RB (OE=1 Rc20) 
mullwo. RT,RA,RB (OE=1 Re=1) 


[POWER mnemonics: muls, muls., mulso, mulso.] 


31 RT RA RB |OE| 235 
0 6 11 16 21 | 22 31 
RT € (RA) 32:63 * (RB) 32:63 


The 32-bit operands are the low-order 32 bits of RA 
and of RB. The 64-bit product of the operands is 
placed into register RT. 


If OE=1 then OV is set to 1 if the product cannot be 
represented in 32 bits. 


Both operands and the product are interpreted as 
signed integers. 


Special Registers Altered: 
(if Rc=1) 


SO OV (if OE-1) 


Programming Note 


For mulli and mullw, the low-order 32 bits of the 
product are the correct 32-bit product for 32-bit 
mode. 


For mulli and mulld, the low-order 64 bits of the 
product are independent of whether the operands 
are regarded as signed or unsigned 64-bit inte- 
gers. For mulli and mullw, the low-order 32 bits 
of the product are independent of whether the 
operands are regarded as signed or unsigned 
32-bit integers. 
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Multiply High Doubleword XO-form 


mulhd RT,RA,RB (Rc=0) 
mulhd. RT,RA,RB = 


31 RT RA RB / 73 Re 
0 6 11 16 21 |22 31 


prodo:197 ¢ (RA) x (RB) 
RT ¢ prodo:63 


The 64-bit operands are (RA) and (RB). The high- 
order 64 bits of the 128-bit product of the operands 
are placed into register RT. 


Both operands and the product are interpreted as 
signed integers. 


Special Registers Altered: 
CRO (if Rc=1) 


Multiply High Doubleword Unsigned 
XO-form 


mulhdu 
mulhdu. 


31 RT RA RB / 9 Re 
0 6 11 16 21 | 22 31 


prodo:127 € (RA) x (RB) 
RT ¢ prodg.¢3 


The 64-bit operands are (RA) and (RB). The high- 
order 64 bits of the 128-bit product of the operands 
are placed into register RT. 


RT,RA,RB 
RT,RA,RB 


Both operands and the product are interpreted as 
unsigned integers, except that if Rc=1 the first three 
bits of CR Field 0 are set by signed comparison of the 
result to zero. 


Special Registers Altered: 
CRO (if Rc=1) 


Multiply High Word XO-form 


mulhw 
mulhw. 


31 RT RA RB / 75 Re 
0 6 11 16 21 |22 31 


prodo:s3 € (RA) 32:63 * (RB) 32:63 

RT 39.63 € prodo:31 

RTo:31 « undefined 

The 32-bit operands are the low-order 32 bits of RA 
and of RB. The high-order 32 bits of the 64-bit 
product of the operands are placed into RT39.¢3. The 
contents of RTo.3; are undefined. 


RT,RA,RB (Rc=0) 
RT,RA,RB (Rc=1) 


Both operands and the product are interpreted as 
signed integers. 


Special Registers Altered: 
CRO (bits 0:2 undefined in 64-bit mode) (if Rc=1) 


Multiply High Word Unsigned XO-form 


mulhwu RT,RA,RB (Rcz0) 
mulhwu. RT,RA,RB (Rc=1) 

31 RT RA RB / 11 Re 
0 6 11 16 21 |22 81 


prodo:s3 € (RA) 32:63 * (RB) 32:63 

RTg2:53 € prodo:31 

RTo:31 € undefined 

The 32-bit operands are the low-order 32 bits of RA 
and of RB. The high-order 32 bits of the 64-bit 
product of the operands are placed into RTzo.g3. The 
contents of RT¿.3, are undefined. 


Both operands and the product are interpreted as 
unsigned integers, except that if Rc=1 the first three 
bits of CR Field O are set by signed comparison of the 
result to zero. 


Special Registers Altered: 
CRO (bits 0:2 undefined in 64-bit mode) (if Rc=1) 
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Divide Doubleword XO-form 


divd RT,RA,RB (OE=0 Rc=0) 
divd. RT,RA,RB (OE=0 Re=1) 
divdo RT,RA,RB (OE=1 Rc20) 
divdo. RT,RA,RB (OE=1 Re=1) 


31 RT RA RB |OE 489 Rc 
0 6 11 16 21 |22 31 


dividendos ¢ (RA) 
divisorg.¢3 ¢ (RB) 
RI € dividend = divisor 


The 64-bit dividend is (RA). The 64-bit divisor is (RB). 
The 64-bit quotient of the dividend and divisor is 
placed into register RT. The remainder is not sup- 
plied as a result. 


Both operands and the quotient are interpreted as 
signed integers. The quotient is the unique signed 
integer that satisfies 


dividend = (quotient x divisor) + r 


where Ox r< |divisor| if the dividend is nonnegative, 
and — |divisor| < r< O if the dividend is negative. 


If an attempt is made to perform any of the divisions 


0x8000 0000 0000 0000 + —1 
<anything> + 0 


then the contents of register RT are undefined as are 
(if Rc=1) the contents of the LT, GT, and EQ bits of 
CR Field 0. In these cases, if OE=1 then OV is set to 
1. 


Special Registers Altered: 
CRO (if Rc=1) 
SO OV (if OE=1) 


Programming Note 


The 64-bit signed remainder of dividing (RA) by 
(RB) can be computed as follows, except in the 
case that (RA) = -263 and (RB) = —1. 


divd | RT,RA,RB # RT = quotient 
mulld  RT,RT,RB # RT = quotient*divisor 
subf  RT,RT,RA # RT = remainder 


Divide Word XO-form 


divw RT,RA,RB (OE=0 Rc=0) 
divw. RT,RA,RB (OE=0 Re=1) 
divwo RT,RA,RB (OE=1 Rc20) 
divwo. RT,RA,RB (OE=1 Re=1) 


31 RT RA RB |OE 491 Rc 
0 6 11 16 21 |22 31 


dividendos € EXTS ( (RA) 32:63) 
divisorg.¢3 € EXTS ( (RB) 32:63) 
RTo:31 4 undefined 


The 64-bit dividend is the sign-extended value of 
(RA)s263: The 64-bit divisor is the sign-extended 
value of (RB)30.63. The 64-bit quotient is formed. The 
low-order 32 bits of the 64-bit quotient are placed into 
RT32:63- The contents of RTo.3; are undefined. The 
remainder is not supplied as a result. 


Both operands and the quotient are interpreted as 
signed integers. The quotient is the unique signed 
integer that satisfies 


dividend = (quotient x divisor) + r 


where 0< r< |divisor| if the dividend is nonnegative, 
and — |divisor| < r< O if the dividend is negative. 


If an attempt is made to perform any of the divisions 


0x8000 0000 + —1 
<anything> + 0 


then the contents of register RT are undefined as are 
(if Rc=1) the contents of the LT, GT, and EQ bits of 
CR Field 0. In these cases, if OE=1 then OV is set to 
1. 


Special Registers Altered: 
CRO (bits 0:2 undefined in 64-bit mode) (if Rc=1) 
SO OV (if OE=1) 


Programming Note 


The 32-bit signed remainder of dividing (RA)s2:63 
by (RB)s2.63 can be computed as follows, except in 
the case that (RA)39.63 = — 2% and (RB)39.63= — 1. 


divw  RT,RA,RB # RT = quotient 
mullw  RT,RT,RB # RT = quotient*divisor 
subf  RT,RT,RA # RT = remainder 
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Divide Doubleword Unsigned XO-form 


divdu RT,RA,RB (OE=0 Rc=0) 
divdu. RT,RA,RB (OE=0 Re=1) 
divduo RT,RA,RB (OE=1 Rc20) 
divduo. RT,RA,RB (OE=1 Re=1) 


31 RT RA RB |OE 457 Re 
0 6 11 16 21 |22 31 


dividendg.g3 ¢ (RA) 

divisorg.¢3 ¢ (RB) 

RT € dividend + divisor 

The 64-bit dividend is (RA). The 64-bit divisor is (RB). 
The 64-bit quotient of the dividend and divisor is 
placed into register RT. The remainder is not sup- 
plied as a result. 


Both operands and the quotient are interpreted as 
unsigned integers, except that if Rc=1 the first three 
bits of CR Field 0 are set by signed comparison of the 
result to zero. The quotient is the unique unsigned 
integer that satisfies 


dividend = (quotient x divisor) + r 


where 0< r< divisor. 


If an attempt is made to perform the division 
«anything» + 0 


then the contents of register RT are undefined as are 
(if Rc21) the contents of the LT, GT, and EQ bits of 
CR Field 0. In this case, if OE=1 then OV is set to 1. 


Special Registers Altered: 
CRO (if Rc=1) 
SO OV (if OE=1) 


Programming Note 


The 64-bit unsigned remainder of dividing (RA) by 
(RB) can be computed as follows. 


divdu RT,RA,RB # RT = quotient 
mulld  RT,RT,RB # RT = quotient*divisor 
subf RT,RT,RA # RT = remainder 


Divide Word Unsigned XO-form 


divwu RT,RA,RB (OE=0 Rc=0) 
divwu. RT,RA,RB (OE=0 Re=1) 
divwuo RT,RA,RB (OE=1 Rc20) 
divwuo. RT,RA,RB (OE=1 Re=1) 


31 RT RA RB |OE 459 Rc 
0 6 11 16 21 |22 31 


dividendosa € 220 | (RA) 30:63 
divisorg.¢3 € 320 (RB) 39-63 
RT32:63 € dividend + divisor 
RT 9-34 + undefined 


The 64-bit dividend is the zero-extended value of 
(RA)3263: The 64-bit divisor is the zero-extended 
value of (RB)3>2:63. The 64-bit quotient is formed. The 
low-order 32 bits of the 64-bit quotient are placed into 
RT39.63. The contents of RT¿.34 are undefined. The 
remainder is not supplied as a result. 


Both operands and the quotient are interpreted as 
unsigned integers, except that if Rc=1 the first three 
bits of CR Field O are set by signed comparison of the 
result to zero. The quotient is the unique unsigned 
integer that satisfies 


dividend= (quotientx divisor) + r 


where Ox r « divisor. 


If an attempt is made to perform the division 
«anything» + 0 


then the contents of register RT are undefined as are 
(if Rc21) the contents of the LT, GT, and EQ bits of 
CR Field 0. In this case, if OE=1 then OV is set to 1. 


Special Registers Altered: 
CRO (bits 0:2 undefined in 64-bit mode) (if Rc=1) 
SO OV (if OE=1) 


Programming Note 


The 32-bit unsigned remainder of dividing 
(RA)39.63 by (RB)35.63 can be computed as follows. 


divwu RT,RA,RB # RT = quotient 
mullw  RT,RT,RB # RT = quotient*divisor 
subf RT,RT,RA # RT = remainder 
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3.3.9 Fixed-Point Compare Instructions 


The fixed-point Compare instructions compare the 
contents of register RA with (1) the sign-extended 
value of the SI field, (2) the zero-extended value of 
the UI field, or (3) the contents of register RB. The 
comparison is signed for cmpi and cmp, and unsigned 
for cmpli and cmpl. 


The L field controls whether the operands are treated 
as 64-bit or 32-bit quantities, as follows: 


L Operand length 
0 32-bit operands 
1 64-bit operands 


When the operands are treated as 32-bit signed quan- 
tities, bit 32 of the register (RA or RB) is the sign bit. 


The Compare instructions set one bit in the leftmost 
three bits of the designated CR field to 1, and the 
other two to 0. XERso is copied to bit 3 of the desig- 
nated CR field. 


Compare Immediate D-form 


cmpi BF,L,RA,SI 


0 6 9nd 11 16 31 


if L= 0 then a € EXTS ( (RA) 32:63) 


else a € (RA) 
if a < EXTS(SI) then c € 0b100 
else if a > EXTS(SI) then c € 0b010 
else c € 0b001 


CR4xBF:4xBF+3 €C | XERso 


The contents of register RA ((RA)32.63 sign-extended 
to 64 bits if L=0) are compared with the sign- 
extended value of the SI field, treating the operands 
as signed integers. The result of the comparison is 
placed into CR field BF. 


Special Registers Altered: 
CR field BF 
Extended Mnemonics: 


Examples of extended mnemonics for Compare Imme- 
diate: 


Extended: Equivalent to: 
cmpdi Rx,value cmpi 0,1,Rx,value 
cmpwi cr3,Rx,value cmpi 3,0,Rx,value 


The CR field is set as follows. 

Bit Name Description 

0 LT (RA) < Sl or (RB) (signed comparison) 
(RA) € Ul or (RB) (unsigned comparison) 

1 GT (RA) » Sl or (RB) (signed comparison) 
(RA) > Ul or (RB) (unsigned comparison) 
(RA) = SI, UI, or (RB) 

Summary Overflow from the XER 


EQ 
3 SO 


Extended mnemonics for compares 


A set of extended mnemonics is provided so that 
compares can be coded with the operand length as 
part of the mnemonic rather than as a numeric 
operand. Some of these are shown as examples with 
the Compare instructions. See Appendix B, “Assem- 
bler Extended Mnemonics" on page 143 for additional 
extended mnemonics. 


Compare X-form 


cmp BF,L,RA,RB 
31 


| 
0 6 9nd 11 16 21 31 


if L= 0 then a € EXTS ( (RA) 30:63) 
b « EXTS ((RB) 39.63) 


else a € (RA) 

b € (RB) 
if a < b then c € 0b100 
else if a > b then c € 0b010 
else c € 06001 


CR4xBF:4xBF+3 * C | XERgo 


The contents of register RA ((RA)s5e63 if L=0) are 
compared with the contents of register RB ((RB)2.gs if 
L=0), treating the operands as signed integers. The 
result of the comparison is placed into CR field BF. 


Special Registers Altered: 
CR field BF 


Extended Mnemonics: 
Examples of extended mnemonics for Compare: 


Extended: Equivalent to: 
cmpd Rx,Ry cmp 0,1, Rx Ry 
cmpw cr3,Rx,Ry cmp — 3,0, Rx, Ry 
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Compare Logical Immediate D-form 


cmpli BF,L,RA,UI 


10 BF |/|L| RA UI 
0 6 [ep 16 31 


if L= 0 then a € 320 | (RA) 39:63 

else a € (RA) 
if a è (480 | UI) then c € 0b100 
else if a > (480 | UI) then c € 0b010 
else c € 00001 
CR4xBF:4xBF+3 * C | XERgo 
The contents of register RA ((RA)s2.63 zero-extended 
to 64 bits if L=0) are compared with “80 | UI, treating 
the operands as unsigned integers. The result of the 
comparison is placed into CR field BF. 


Special Registers Altered: 
CR field BF 
Extended Mnemonics: 


Examples of extended mnemonics for Compare 
Logical Immediate: 


Extended: Equivalent to: 
cmpldi Rx,value cmpli 0,1,Rx,value 
cmplwi cr3,Rx,value cmpli 3,0,Rx,value 


Compare Logical X-form 


cmpl BF,L,RA,RB 


31 BF |/|L| RA RB 32 / 
0 6 [egt 16 21 31 


if L= 0 then a € 320 | (RA) 30:63 
b € 9*0 | (RB) 30.63 
else a € (RA) 
b € (RB) 
if a < b then c € 0b100 
else if a > b then c € 0b010 
else c € 0b001 


CR4xBF:4xBF+3 * € | XERgo 


The contents of register RA ((RA)s263 if L=0) are 
compared with the contents of register RB ((RB)39.g3 if 
L=0), treating the operands as unsigned integers. 
The result of the comparison is placed into CR field 
BF. 


Special Registers Altered: 
CR field BF 
Extended Mnemonics: 


Examples of extended mnemonics for Compare 
Logical: 


Extended: Equivalent to: 
cmpld | Rx; Ry cmpl 0,1,Rx,Ry 
cmplw cr3,Rx,Ry cmpl 3,0,Rx,Ry 
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3.3.10 Fixed-Point Trap Instructions 


The Trap instructions are provided to test for a speci- 
fied set of conditions. If any of the conditions tested 
by a Trap instruction are met, the system trap handler 
is invoked. If none of the tested conditions are met, 
instruction execution continues normally. 


The contents of register RA are compared with either 
the sign-extended value of the SI field or the contents 
of register RB, depending on the Trap instruction. For 
tdi and td, the entire contents of RA (and RB) partic- 
ipate in the comparison; for twi and tw, only the con- 
tents of the low-order 32 bits of RA (and RB) 
participate in the comparison. 


This comparison results in five conditions which are 


ANDed with TO. If the result is not O the system trap 
handler is invoked. These conditions are as follows. 


Trap Doubleword Immediate D-form 


tdi TO,RA,SI 
2 TO RA SI 
0 6 11 16 31 
a € (RA) 
if (a < EXTS(SI)) & TOg then TRAP 
if (a > EXTS(SI)) & TO, then TRAP 
if (a = EXTS(SI)) & TO, then TRAP 
if (a è EXTS(SI)) & TOg then TRAP 
if (a > EXTS(SI)) & T04 then TRAP 


The contents of register RA are compared with the 
sign-extended value of the SI field. If any bit in the 
TO field is set to 1 and its corresponding condition is 
met by the result of the comparison, the system trap 
handler is invoked. 


Special Registers Altered: 
None 
Extended Mnemonics: 


Examples of extended 
Doubleword Immediate: 


mnemonics for Trap 


Extended: Equivalent to: 
tdlti Rx,value tdi 16,Rx,value 
tdnei Rx,value tdi 24,Rx,value 


TO Bit ANDed with Condition 

0 Less Than, using signed comparison 

1 Greater Than, using signed comparison 

2 Equal 

3 Less Than, using unsigned comparison 

4 Greater Than, using unsigned comparison 


Extended mnemonics for traps 


A set of extended mnemonics is provided so that 
traps can be coded with the condition as part of the 
mnemonic rather than as a numeric operand. Some 
of these are shown as examples with the Trap 
instructions. See Appendix B, “Assembler Extended 
Mnemonics” on page 143 for additional extended 
mnemonics. 


Trap Word Immediate D-form 


twi TO,RA,SI 
[POWER mnemonic: ti] 


3 TO RA Sl 
0 6 11 16 31 


2 
if (a < EXTS (ST) & T09 then TRAP 
if (a > EXTS(SI)) & TO; then TRAP 
if (a = EXTS(SI)) & TO> then TRAP 
if (a € EXTS(SI)) & TOS then TRAP 
if (a ? EXTS(SI)) & TO4 then TRAP 


The contents of RAgo.g3 are compared with the sign- 
extended value of the SI field. If any bit in the TO 
field is set to 1 and its corresponding condition is met 
by the result of the comparison, the system trap 
handler is invoked. 


Special Registers Altered: 
None 
Extended Mnemonics: 


Examples of extended mnemonics for Trap Word 
Immediate: 


Extended: Equivalent to: 
twgti — Rx,value twi 8,Rx,value 
twllei  Rx,value twi 6,Rx,value 
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Trap Doubleword X-form 


td TO,RA,RB 
31 TO RA RB 68 / 
0 6 11 16 21 31 
a € (RA) 
b € (RB) 
if (a < b) & TOS then TRAP 
if (a > b) € TO, then TRAP 
if (a = b) & TO, then TRAP 
if (a € b) & TOz then TRAP 
if (a > b) & T04 then TRAP 


The contents of register RA are compared with the 
contents of register RB. If any bit in the TO field is 
set to 1 and its corresponding condition is met by the 
result of the comparison, the system trap handler is 
invoked. 


Special Registers Altered: 
None 


Extended Mnemonics: 


Examples of extended mnemonics for Trap 
Doublewora: 

Extended: Equivalent to: 

tdge Rx,Ry td 12,Rx,Ry 

tdini Rx,Ry td 5,Rx, Ry 


Trap Word X-form 


tw TO,RA,RB 
[POWER mnemonic: t] 


31 TO RA RB 4 / 
0 6 11 16 21 31 


b € EXTS ( (RB) 32.63) 

if (a < b) & TOg then TRAP 
if (a > b) & TO, then TRAP 
if (a = b) & TO» then TRAP 
if (a € b) € TO4 then TRAP 
if (a > b) & TO4 then TRAP 


The contents of RAgo.g3 are compared with the con- 
tents of RBs5:63. If any bit in the TO field is set to 1 
and its corresponding condition is met by the result of 
the comparison, the system trap handler is invoked. 


Special Registers Altered: 
None 


Extended Mnemonics: 


Examples of extended mnemonics for Trap Word: 


Extended: Equivalent to: 
tweq  Rx,Ry tw 4,Rx,Ry 
twlge Rx,Ry tw 5,Rx,Ry 
trap tw 31,0,0 
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3.3.11 Fixed-Point Logical Instructions 


The Logical instructions perform bit-parallel oper- 
ations on 64-bit operands. 


The X-form Logical instructions with Rc=1, and the 
D-form Logical instructions andi. and andis., set the 
first three bits of CR Field O as described in Section 
3.3.7, “Other Fixed-Point Instructions” on page 48. 
The Logical instructions do not change the SO, OV, 
and CA bits in the XER. 


AND Immediate D-form 


andi. RA,RS,UI 
[POWER mnemonic: andil.] 


28 RS RA UI 
0 6 11 16 31 


RA € (RS) & (480 | UI) 
The contents of register RS are ANDed with “80 | UI 
and the result is placed into register RA. 


Special Registers Altered: 
CRO 
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Extended mnemonics for logical 
operations 


An extended mnemonic is provided that generates the 
preferred form of “no-op” (an instruction that does 
nothing). This is shown as an example with the OR 
Immediate instruction. 


Extended mnemonics are provided that use the OR 
and NOR instructions to copy the contents of one reg- 
ister to another, with and without complementing. 
These are shown as examples with the two 
instructions. 


See Appendix B, "Assembler Extended Mnemonics" 
on page 143 for additional extended mnemonics. 


AND Immediate Shifted D-form 


andis. RA,RS,UI 
[POWER mnemonic: andiu.] 


29 RS RA UI 
0 6 11 16 31 


RA € (RS) & (320 | ur | 180) 
The contents of register RS are ANDed with 20 | UI | 
160 and the result is placed into register RA. 


Special Registers Altered: 
CRO 
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OR Immediate D-form 


ori RA,RS,UI 
[POWER mnemonic: oril] 


24 RS RA UI 
0 6 11 16 31 


RA € (RS) | (480 | UI) 
The contents of register RS are ORed with 480 | UI 
and the result is placed into register RA. 
The preferred “no-op” (an instruction that does 
nothing) is: 
ori 0,0,0 
Special Registers Altered: 
None 
Extended Mnemonics: 
Example of extended mnemonics for OR Immediate: 


Extended: Equivalent to: 
nop ori 0,0,0 


Engineering Note 


It is desirable for implementations to make the 
preferred form of no-op execute quickly, since this 
form should be used by compilers. 


XOR Immediate D-form 


xori RA,RS,UI 
[POWER mnemonic: xoril] 


26 RS RA Ul 
0 6 11 16 31 


RA € (RS) e (480 | UL) 
The contents of register RS are XORed with 480 | Ul 
and the result is placed into register RA. 


Special Registers Altered: 
None 


OR Immediate Shifted D-form 


oris RA,RS,UI 
[POWER mnemonic: oriu] 


25 RS RA UI 
0 6 11 16 31 


RA € (RS) | (820 | ur | 180) 


The contents of register RS are ORed with °20 | UI | 
160 and the result is placed into register RA. 


Special Registers Altered: 
None 


XOR Immediate Shifted D-form 


xoris RA,RS,UI 
[POWER mnemonic: xoriu] 


27 RS RA UI 
0 6 11 16 31 


RA € (RS) e (20 | ur | 180) 


The contents of register RS are XORed with 320 | UI | 
169 and the result is placed into register RA. 


Special Registers Altered: 
None 
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AND X-form 
and RA,RS,RB (Rc=0) 
and. RA,RS,RB (Rc=1) 


31 RS RA RB 28 Re 
0 6 11 16 21 31 


RA € (RS) & (RB) 


The contents of register RS are ANDed with the con- 
tents of register RB and the result is placed into reg- 


ister RA. 
Special Registers Altered: 

CRO (if Re=1) 
XOR X-form 
xor RA,RS,RB (Rc=0) 
xor. RA,RS,RB (Rc=1) 


31 RS RA RB 316 Re 
0 6 11 16 21 31 


RA € (RS) O (RB) 


The contents of register RS are XORed with the con- 
tents of register RB and the result is placed into reg- 


ister RA. 


Special Registers Altered: 
CRO (if Rc=1) 
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OR X-form 
or RA,RS,RB (Rc=0) 
or. RA,RS,RB (Rc=1) 


31 RS RA RB 444 Re 
0 6 11 16 21 31 


RA € (RS) | (RB) 


The contents of register RS are ORed with the con- 
tents of register RB and the result is placed into reg- 
ister RA. 


Special Registers Altered: 
CRO (if Rc=1) 


Extended Mnemonics: 
Example of extended mnemonics for OR: 


Extended: Equivalent to: 

mr Rx,Ry or Rx, Ry, Ry 
NAND X-form 
nand RA,RS,RB (Rc=0) 
nand. RA,RS,RB (Rc=1) 


31 RS RA RB 476 Re 
0 6 11 16 21 31 


RA € 7((RS) € (RB)) 


The contents of register RS are ANDed with the con- 
tents of register RB and the complemented result is 
placed into register RA. 


Special Registers Altered: 
CRO (if Rc=1) 


Programming Note 


nand or nor with RS=RB can be used to obtain 
the one's complement. 
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NOR X-form 
nor RA,RS,RB (Rc=0) 
nor. RA,RS,RB (Rc=1) 


31 RS RA RB 124 Re 
0 6 11 16 21 31 


RA € ^((RS) | (RB)) 


The contents of register RS are ORed with the con- 
tents of register RB and the complemented result is 
placed into register RA. 


Special Registers Altered: 
CRO (if Rc=1) 


Extended Mnemonics: 
Example of extended mnemonics for NOR: 


Extended: 
not Rx,Ry nor 


Equivalent to: 
Rx,Ry,Ry 


AND with Complement X-form 


andc RA,RS,RB (Rc=0) 
andc. RA,RS,RB 


31 RS RA RB 60 Re 
0 6 11 16 21 31 


RA € (RS) & 7(RB) 


The contents of register RS are ANDed with the com- 
plement of the contents of register RB and the result 
is placed into register RA. 


Special Registers Altered: 
CRO (if Rc=1) 


Equivalent X-form 


eqv RA,RS,RB (Rc=0) 
eqv. RA,RS,RB (Rc=1) 


31 RS RA RB 284 Re 
0 6 11 16 21 31 


RA € (RS) = (RB) 


The contents of register RS are XORed with the con- 
tents of register RB and the complemented result is 
placed into register RA. 


Special Registers Altered: 
CRO (if Rc=1) 


OR with Complement X-form 


orc RA,RS,RB (Rc=0) 
orc. RA,RS,RB (Rc=1) 


31 RS RA RB 412 Re 
0 6 11 16 21 31 


RA € (RS) | 7(RB) 


The contents of register RS are ORed with the com- 
plement of the contents of register RB and the result 
is placed into register RA. 


Special Registers Altered: 
CRO (if Rc=1) 
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Extend Sign Byte X-form Extend Sign Halfword X-form 
extsb RA,RS (Rc=0) extsh RA,RS (Rc=0) 
extsb. RA,RS (Rc=1) extsh. RA,RS (Rc=1) 


[POWER mnemonics: exts, exts.] 


31 RS RA 111 954 Re 
0 6 11 16 21 31 31 RS RA Ill 922 Rc 
0 6 11 16 21 


31 


s € (RS) 56 
asse € (99) ssa S € (RS) 4g 
055 $75 RA48:63 € (RS) 48:63 
(RS)s6:63 are placed into RAsg.g3. Bit 56 of register RS RAg.47 € "s 
is placed into RÃg.ss- (RS)45,55 are placed into RA4g44. Bit 48 of register RS 


; ; is placed into RAg.47. 
Special Registers Altered: É 


CRO (if Re=1) Special Registers Altered: 
CRO (if Re=1) 


Extend Sign Word X-form 


extsw RA,RS (Rc=0) 

extsw. RA,RS (Re=1) 
31 RS RA 111 986 Re 

0 6 11 16 21 31 

s € (RS) 32 

RAgo.63 € (RS) 32:63 

Rios: € %s 


(RS)32.63 are placed into RA39.¢3. Bit 32 of register RS 
is placed into RA. 


Special Registers Altered: 
CRO (if Rc=1) 
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Count Leading Zeros Doubleword 
X-form 


cntizd RA,RS (Rc=0) 
cntizd. RA,RS (Re=1) 

31 RS RA Ill 58 Rc 
0 6 11 16 21 31 
n +0 


do while n < 64 
if (RS), = 1 then leave 
nten+1 

RA €n 


A count of the number of consecutive zero bits 
starting at bit O of register RS is placed into RA. This 
number ranges from 0 to 64, inclusive. 


If Rc=1, CR Field O is set to reflect the result. 


Special Registers Altered: 
CRO (if Rc=1) 


Count Leading Zeros Word X-form 


cntlzw 
cntlzw. 


RA,RS (Rc=0) 
RA,RS (Rc=1) 


[POWER mnemonics: cntlz, cntlz.] 


31 RS RA Il] 26 Rc 
0 6 11 16 21 31 


n € 32 

do while n < 64 
if (RS), = 1 then leave 
n*nt*1 

BA €n — 32 


A count of the number of consecutive zero bits 
starting at bit 32 of register RS is placed into RA. 
This number ranges from 0 to 32, inclusive. 

If Rc=1, CR Field O is set to reflect the result. 


Special Registers Altered: 
CRO (if Rc=1) 


Programming Note 


For both Count Leading Zeros instructions, if 
Rc=1 then LT is set to 0 in CR Field 0. 
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3.3.12 Fixed-Point Rotate and Shift Instructions 


The Fixed-Point Processor performs rotation oper- 
ations on data from a GPR and returns the result, or a 
portion of the result, to a GPR. 


The rotation operations rotate a 64-bit quantity left by 
a specified number of bit positions. Bits that exit from 
position O enter at position 63. 


Two types of rotation operation are supported. 


For the first type, denoted rotateg, or ROTL¢4, the 
value rotated is the given 64-bit value. The rotateg, 
operation is used to rotate a given 64-bit quantity. 


For the second type, denoted rotate3. or ROTL39, the 
value rotated consists of two copies of bits 32:63 of 
the given 64-bit value, one copy in bits 0:31 and the 
other in bits 32:63. The rotatez, operation is used to 
rotate a given 32-bit quantity. 


The Rotate and Shift instructions employ a mask gen- 
erator. The mask is 64 bits long, and consists of 
1-bits from a start bit, mstart, through and including a 
stop bit, mstop, and 0-bits elsewhere. The values of 
mstart and mstop range from O to 63. If mstart > 
mstop, the 1-bits wrap around from position 63 to 
position 0. Thus the mask is formed as follows: 


if mstart < mstop then 
MaSKmetart:mstop 7 Ones 
maskali other bits = Zeros 
else 
masKmstart:63 = Ones 
MASKg-mstop = Ones 
maskali other bits = Zeros 


There is no way to specify an all-zero mask. 


For instructions that use the rotate? operation, the 
mask start and stop positions are always in the low- 
order 32 bits of the mask. 


The use of the mask is described in following 
sections. 


The Rotate and Shift instructions with Rc=1 set the 
first three bits of CR field 0 as described in Section 
3.3.7, “Other Fixed-Point Instructions” on page 48. 
Rotate and Shift instructions do not change the OV 
and SO bits. Rotate and Shift instructions, except 
algebraic right shifts, do not change the CA bit. 


Extended mnemonics for rotates and 
shifts 


The Rotate and Shift instructions, while powerful, can 
be complicated to code (they have up to five oper- 
ands). A set of extended mnemonics is provided that 
allow simpler coding of often-used functions such as 
clearing the leftmost or rightmost bits of a register, 
left justifying or right justifying an arbitrary field, and 
performing simple rotates and shifts. Some of these 
are shown as examples with the Rotate instructions. 
See Appendix B, “Assembler Extended Mnemonics” 
on page 143 for additional extended mnemonics. 


3.3.12.1 Fixed-Point Rotate Instructions 


These instructions rotate the contents of a register. 
The result of the rotation is 


m inserted into the target register under control of a 
mask (if a mask bit is 1 the associated bit of the 
rotated data is placed into the target register, 
and if the mask bit is 0 the associated bit in the 
target register remains unchanged); or 


m ANDed with a mask before being placed into the 
target register. 


The Rotate Left instructions allow right-rotation of the 
contents of a register to be performed (in concept) by 
a left-rotation of 64—n, where n is the number of bits 
by which to rotate right. They allow right-rotation of 
the contents of the low-order 32 bits of a register to 
be performed (in concept) by a left-rotation of 32-n, 
where n is the number of bits by which to rotate right. 


Architecture Note 


For MD-form and MDS-form instructions, the MB 
and ME fields are used in permuted rather than 
sequential order because this is easier for the 
processor. Permuting the MB field permits the 
processor to obtain the low-order five bits of the 
MB value from the same place for all instructions 
having an MB field (M-form and MD-form 
instructions). Permuting the ME field permits the 
processor to treat bits 21:26 of all MD-form 
instructions uniformly. 
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Rotate Left Doubleword Immediate then 
Clear Left MD-form 


ridicl RA,RS,SH,MB 
ridicl. RA,RS,SH,MB 


30 RS RA sh mb 0 lshRa 
0 6 11 16 21 27 |30 


n € shs | sho.4 

r € ROTLg4((RS), n) 
b« mbs | 353. 

m € MASK(b, 63) 

RA trem 


The contents of register RS are rotatedg, left SH bits. 
A mask is generated having 1-bits from bit MB 
through bit 63 and 0-bits elsewhere. The rotated data 
are ANDed with the generated mask and the result is 
placed into register RA. 


Special Registers Altered: 
CRO (if Rc=1) 
Extended Mnemonics: 


Examples of extended mnemonics for Rotate Left 
Doubleword Immediate then Clear Left: 


Extended: Equivalent to: 

extrdi Rx,Ry,n,b ridicl Rx,Ry,b+n,64-n 
srdi Rx,Ry,n ridicl Rx,Ry,64—n,n 
clridi Rx,Ry,n ridicl Rx,Ry,0,n 


Rotate Left Doubleword Immediate then 
Clear Right MD-form 


ridicr RA,RS,SH,ME 
ridicr. RA,RS,SH,ME 


30 RS RA sh me 1 |shiRa 
0 6 11 16 21 27 |30 


n € shs | sho. 

r € ROTLg4((RS), n) 
e € mes meda 

m € MASK(0, e) 

RA €r ¿ m 


The contents of register RS are rotatedg, left SH bits. 
A mask is generated having 1-bits from bit O through 
bit ME and 0-bits elsewhere. The rotated data are 
ANDed with the generated mask and the result is 
placed into register RA. 


Special Registers Altered: 
CRO (if Rc=1) 
Extended Mnemonics: 


Examples of extended mnemonics for Rotate Left 
Doubleword Immediate then Clear Right: 


Extended: Equivalent to: 

extldi Rx,Ry,n,b ridicr Rx,Ry,b,n— 1 
sldi Rx,Ry,n ridicr Rx,Ry,n,63-n 
clrrdi Rx,Ry,n ridicr Rx,Ry,0,63-n 


Programming Note 


ridicl can be used to extract an n-bit field that 
staris at bit position b in register RS, right- 
justified into register RA (clearing the remaining 
64-n bits of RA), by setting SH=b+n and 
MB=64-n. It can be used to rotate the contents 
of a register left (right) by n bits, by setting SH=n 
(64—n) and MB=0. It can be used to shift the 
contents of a register right by n bits, by setting 
SH=64-n and MB=n. It can be used to clear the 
high-order n bits of a register, by setting SH=0 
and MB=n. 


Extended mnemonics are provided for all of these 
uses; see Appendix B, “Assembler Extended 
Mnemonics” on page 143. 


Programming Note 


rldicr can be used to extract an n-bit field that 
starts at bit position b in register RS, left-justified 
into register RA (clearing the remaining 64—n bits 
of RA), by setting SH=b and ME=n-1. It can be 
used to rotate the contents of a register left 
(right) by n bits, by setting SH=n (64-n) and 
ME=63. lt can be used to shift the contents of a 
register left by n bits, by setting SH=n and 
ME=63-n. It can be used to clear the low-order 
n bits of a register, by setting SH=0 and 
ME=63-n. 


Extended mnemonics are provided for all of these 
uses (some devolve to rldicl); see Appendix B, 
"Assembler Extended Mnemonics" on page 143. 
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Rotate Left Doubleword Immediate then 
Clear MD-form 


ridic RA,RS,SH,MB 
ridic. RA,RS,SH,MB 


30 RS RA sh mb 2 IshRg 
0 6 11 16 21 27 |30 


n € shs | sho:4 

r ¢ ROTLg4((RS), n) 
b« mbs boa 

m € MASK(b, ^n) 
RA«r&m 


The contents of register RS are rotatedg, left SH bits. 
A mask is generated having 1-bits from bit MB 
through bit 63- SH and 0-bits elsewhere. The rotated 
data are ANDed with the generated mask and the 
result is placed into register RA. 


Special Registers Altered: 
CRO (if Rc=1) 
Extended Mnemonics: 


Example of extended mnemonics for Rotate Left 
Doubleword Immediate then Clear: 


Extended: Equivalent to: 
cirisidi Rx,Ry,b,n ridic  Rx,Ry,n,b-n 


Programming Note 


ridic can be used to clear the high-order b bits of 
the contents of a register and then shift the result 
left by n bits, by setting SH=n and MB=b-n. It 
can be used to clear the high-order n bits of a 
register, by setting SH=0 and MB=n. 


Extended mnemonics are provided for both of 
these uses (the second devolves to rldicl); see 
Appendix B, "Assembler Extended Mnemonics" 
on page 143. 


Rotate Left Word Immediate then AND 
with Mask M-form 


rlwinm RA,RS,SH,MB,ME (Rc=0) 
rlwinm. RA,RS,SH,MB,ME (Rc=1) 
[POWER mnemonics: rlinm, rlinm.] 

21 RS RA SH MB ME |Rc 
0 6 11 16 21 26 31 
n € SH 


rt ROT Lao ( (RS) 39-63, n) 
m € MASK(MB+32, ME+32) 
RA tram 


The contents of register RS are rotated, left SH bits. 
A mask is generated having 1-bits from bit MB+32 
through bit ME+32 and 0-bits elsewhere. The rotated 
data are ANDed with the generated mask and the 
result is placed into register RA. 


Special Registers Altered: 
CRO (if Rc=1) 
Extended Mnemonics: 


Examples of extended mnemonics for Rotate Left 
Word Immediate then AND with Mask: 


Extended: 

extlwi Rx,Ry,n,b 
srwi Rx,Ry,n 
clrrwi Rx,Ry,n 


Equivalent to: 

rlwinm Rx, Ry,b,0,n- 1 
rlwinm Rx,Ry,32- n,n,31 
rlwinm Rx,Ry,0,0,31- n 
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Programming Note 


Let RSL represent the low-order 32 bits of reg- 
ister RS, with the bits numbered from O through 
31. 


rlwinm can be used to extract an n-bit field that 
starts at bit position b in RSL, right-justified into 
the low-order 32 bits of register RA (clearing the 
remaining 32—n bits of the low-order 32 bits of 
RA), by setting SH=b+n, MB=32-n, and 
ME=31. It can be used to extract an n-bit field 
that starts at bit position b in RSL, left-justified 
into the low-order 32 bits of register RA (clearing 
the remaining 32- n bits of the low-order 32 bits of 
RA), by setting SH=b, MB = 0, and ME=n-1. It 
can be used to rotate the contents of the low- 
order 32 bits of a register left (right) by n bits, by 
setting SH=n (32-n), MB=0, and ME=31. It can 
be used to shift the contents of the low-order 32 
bits of a register right by n bits, by setting 
SH=32-n, MB=n, and ME=31. It can be used to 
clear the high-order b bits of the low-order 32 bits 
of the contents of a register and then shift the 
result left by n bits, by setting SH=n, MB=b-n, 
and ME=31-n. It can be used to clear the low- 
order n bits of the low-order 32 bits of a register, 
by setting SH=0, MB=0, and ME=31-n. 


For all the uses given above, the high-order 32 
bits of register RA are cleared. 


Extended mnemonics are provided for all of these 
uses; see Appendix B, “Assembler Extended 
Mnemonics” on page 148. 


Rotate Left Doubleword then Clear Left 
MDS-form 


ridcl RA,RS,RB,MB (Rc-0) 

ridel. RA,RS,RB,MB (Re=1) 
30 RS RA RB mb 8 |Rc 

0 6 11 16 21 27 |a 

n € (RB) 58:63 

r € ROTLgq( (RS), n) 

b ¢ mbs i mbo:4 

m ¢ MASK(b, 63) 

RA tram 


The contents of register RS are rotatedg, left the 
number of bits specified by (RB)sggs. A mask is gen- 
erated having 1-bits from bit MB through bit 63 and 
O-bits elsewhere. The rotated data are ANDed with 
the generated mask and the result is placed into reg- 
ister RA. 


Special Registers Altered: 

CRO (if Rc=1) 
Extended Mnemonics: 
Example of extended mnemonics for Rotate Left 
Doubleword then Clear Left: 


Extended: Equivalent to: 
rotid Rx,Ry,Rz ridcl Rx,Ry,Rz,0 


Programming Note 


rldcl can be used to extract an n-bit field that 
starts at variable bit position b in register RS, 
right-justified into register RA (clearing the 
remaining 64-n bits of RA), by setting 
RBeg.gg=b +n and MB=64-n. It can be used to 
rotate the contents of a register left (right) by var- 
iable n bits, by setting RBss.eg=n (64-n) and 
MB=0. 


Extended mnemonics are provided for some of 
these uses; see Appendix B, “Assembler 
Extended Mnemonics” on page 143. 
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Rotate Left Doubleword then Clear Right 
MDS-form 


rider RA,RS,RB,ME (Rc=0) 

rider. RA,RS,RB,ME (Rc=1) 
30 RS RA RB me 9 |Rc 

0 6 11 16 21 27 |31 

n € (RB) 58:53 

r € ROTLg4((RS), n) 

e € mes ¡el 

m € MASK(0, e) 

RA tram 


The contents of register RS are rotatedg, left the 
number of bits specified by (RB)sg.g3. A mask is gen- 
erated having 1-bits from bit O through bit ME and 
O-bits elsewhere. The rotated data are ANDed with 
the generated mask and the result is placed into reg- 
ister RA. 


Special Registers Altered: 
CRO (if Rc=1) 


Programming Note 


rldcr can be used to extract an n-bit field that 
starts at variable bit position b in register RS, left- 
justified into register RA (clearing the remaining 
64-n bits of RA), by setting RBss.sg=b and 
ME=n-1. It can be used to rotate the contents of 
a register left (right) by variable n bits, by setting 
RBss:63= n (64— n) and ME=63. 


Extended mnemonics are provided for some of 
these uses (some devolve to rldch; see 
Appendix B, “Assembler Extended Mnemonics” 
on page 143. 


Rotate Left Word then AND with Mask 
M-form 


rlwnm RA,RS,RB,MB,ME (Rcz0) 
rlwnm. RA,RS,RB,MB,ME (Rc=1) 
[POWER mnemonics: rlnm, rinm.] 

23 RS RA RB MB ME |Rc 
0 6 11 16 21 26 31 
n € (RB) 59:63 


rt ROT Lao ( (RS) 39-63, n) 
m € MASK(MB+32, ME+32) 
RA tram 


The contents of register RS are rotateds> left the 
number of bits specified by (RB)so.gs. A mask is gen- 
erated having 1-bits from bit MB+32 through bit 
ME+32 and 0-bits elsewhere. The rotated data are 
ANDed with the generated mask and the result is 
placed into register RA. 


Special Registers Altered: 

CRO (if Rc=1) 
Extended Mnemonics: 
Example of extended mnemonics for Rotate Left Word 
then AND with Mask: 


Extended: 
rotlw — Rx; Ry,Rz 


Equivalent to: 
rlwnm Rx,Ry,Rz,0,31 


Programming Note 


Let RSL represent the low-order 32 bits of reg- 
ister RS, with the bits numbered from O through 
31. 


rlwnm can be used to extract an n-bit field that 
starts at variable bit position b in RSL, right- 
justified into the low-order 32 bits of register RA 
(clearing the remaining 32—n bits of the low-order 
32 bits of RA), by setting RBsgeg=D+n, 
MB=32-n, and ME=31. It can be used to extract 
an n-bit field that starts at variable bit position b 
in RSL, left-justified into the low-order 32 bits of 
register RA (clearing the remaining 32-n bits of 
the low-order 32 bits of RA), by setting RBsg.gs- b, 
MB = 0, and ME=n-1. It can be used to rotate 
the contents of the low-order 32 bits of a register 
left (right) by variable n bits, by setting RBsg.3= n 
(32- n), MB=0, and ME=31. 


For all the uses given above, the high-order 32 
bits of register RA are cleared. 


Extended mnemonics are provided for some of 
these uses; see Appendix B, "Assembler 
Extended Mnemonics" on page 143. 


72 PowerPC AS User Instruction Set Architecture, Tags Inactive Subset 


IBM Confidential - Feb. 24, 1999 


Rotate Left Doubleword Immediate then 
Mask Insert MD-form 


RA,RS,SH,MB 
RA,RS,SH,MB 


ridimi 
ridimi. 


30 RS RA sh mb 3 ishRq 
0 6 11 16 21 27 |30 


n € shs | sho4 
r € ROTLg4((RS), n) 
b« mbs nba 
m € MASK (b, ^n) 
RA € ram | (RAjém 


The contents of register RS are rotatedg, left SH bits. 
A mask is generated having 1-bits from bit MB 
through bit 63- SH and 0-bits elsewhere. The rotated 
data are inserted into register RA under control of the 
generated mask. 


Special Registers Altered: 
CRO (if Rc=1) 
Extended Mnemonics: 


Example of extended mnemonics for Rotate Left 
Doubleword Immediate then Mask Insert: 


Extended: 
insrdi Rx,Ry,n,b 


Equivalent to: 
ridimi Rx,Ry,64- (b+n),b 


Programming Note 


rldimi can be used to insert an n-bit field that is 
right-justified in register RS, into register RA 
starting at bit position b, by setting 
SH=64-(b+n) and MB=b. 


An extended mnemonic is provided for this use; 
see Appendix B, “Assembler Extended 


Mnemonics” on page 143. 


Rotate Left Word Immediate then Mask 
Insert M-form 


rlwimi RA,RS,SH,MB,ME (Rc=0) 
rlwimi. RA,RS,SH,MB,ME (Rc=1) 
[POWER mnemonics: rlimi, rlimi.] 

20 RS RA SH MB ME [Re 
0 6 11 16 21 26 31 
n € SH 


rt ROT Lago ( (RS) 39-63, n) 

m € MASK(MB+32, ME+32) 

RA € ram | (RAjém 

The contents of register RS are rotateds, left SH bits. 
A mask is generated having 1-bits from bit MB+32 
through bit ME+32 and 0-bits elsewhere. The rotated 
data are inserted into register RA under control of the 
generated mask. 


Special Registers Altered: 

CRO (if Rc=1) 
Extended Mnemonics: 
Example of extended mnemonics for Rotate Left Word 
Immediate then Mask Insert: 


Extended: 
inslwi Rx,Ry,n,b 


Equivalent to: 
rlwimi Rx,Ry,32-b,b,b+n- 1 


Programming Note 


Let RAL represent the low-order 32 bits of reg- 
ister RA, with the bits numbered from O through 
31. 


rlwimi can be used to insert an n-bit field that is 
left-justified in the low-order 32 bits of register 
RS, into RAL starting at bit position b, by setting 
SH232- b, MB=b, and ME=(b+n)-1. It can be 
used to insert an n-bit field that is right-justified in 
the low-order 32 bits of register RS, into RAL 
starting at bit position b, by setting 
SH=32-(b+n), MB=b, and ME=(b+n)-1. 


Extended mnemonics are provided for both of 
these uses; see Appendix B, “Assembler 
Extended Mnemonics” on page 143. 
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3.3.12.2 Fixed-Point Shift Instructions 


The instructions in this section perform left and right 
shifts. 


Extended mnemonics for shifts 


Immediate-form logical (unsigned) shift operations are 
obtained by specifying appropriate masks and shift 
values for certain Rotate instructions. A set of 
extended mnemonics is provided to make coding of 
such shifts simpler and easier to understand. Some 
of these are shown as examples with the Rotate 
instructions. See Appendix B, “Assembler Extended 
Mnemonics” on page 143 for additional extended 
mnemonics. 


Shift Left Doubleword X-form 


sld RA,RS,RB (Rc-0) 
sld. RA,RS,RB (Re=1) 

31 RS RA RB 27 Re 
0 6 11 16 21 31 
n € (RB) 58:63 


rt ROTLg4 ( (RS), n) 
if (RB) 57 = 0 then 
m € MASK(0, 63-n) 
else m € 940 
RA t€rém 


The contents of register RS are shifted left the 
number of bits specified by (RB)s7.63. Bits shifted out 
of position 0 are lost. Zeros are supplied to the 
vacated positions on the right. The result is placed 
into register RA. Shift amounts from 64 to 127 give a 
zero result. 


Special Registers Altered: 
CRO (if Rc=1) 


Programming Note 


Any Shift Right Algebraic instruction, followed by 
addze, can be used to divide quickly by 2". The 
setting of the CA bit by the Shift Right Algebraic 
instructions is independent of mode. 


Programming Note 


Multiple-precision shifts can be programmed as 
shown in Section C.1, “Multiple-Precision Shifts” 
on page 155. 


Engineering Note 


The instructions intended for use with 32-bit data 
are shown as doing a rotatez, operation. This is 
strictly necessary only for setting the CA bit for 
srawi and sraw. slw and srw could do a rotateg, 
operation if that is easier. 


Shift Left Word X-form 


slw RA,RS,RB (Rc=0) 
slw. RA,RS,RB (Rc=1) 
[POWER mnemonics: sl, sl.] 

31 RS RA RB 24 Re 
0 6 11 16 21 31 


n € (RB) 59:63 
rt ROT Lao ( (RS) 32:637 n) 
if (RB) 58 = 0 then 

m € MASK(32, 63-n) 
else m € 940 
RA«r&m 


The contents of the low-order 32 bits of register RS 
are shifted left the number of bits specified by 
(RB)sg:3 Bits shifted out of position 32 are lost. 
Zeros are supplied to the vacated positions on the 
right. The 32-bit result is placed into RAs5.63. RAg:31 
are set to zero. Shift amounts from 32 to 63 give a 
zero result. 


Special Registers Altered: 
CRO (if Rc=1) 
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Shift Right Doubleword X-form 


srd RA,RS,RB (Rc-0) 
srd. RA,RS,RB (Rc-1) 

31 RS RA RB 539 Rc 
0 6 11 16 21 31 


r € ROTLg4((RS), 642) 
if (RB) 57 = 0 then 

m € MASK(n, 63) 
else m € 640 
RA t€rém 


The contents of register RS are shifted right the 
number of bits specified by (RB)s7.63. Bits shifted out 
of position 63 are lost. Zeros are supplied to the 
vacated positions on the left. The result is placed into 
register RA. Shift amounts from 64 to 127 give a zero 
result. 


Special Registers Altered: 
CRO (if Rc=1) 


Shift Right Word X-form 


srw RA,RS,RB 
srw. RA,RS,RB (Rc=1) 


[POWER mnemonics: sr, sr.] 


31 RS RA RB 536 Re 
0 6 11 16 21 31 


n € (RB) 59:63 
rt ROT Lago ( (RS) 39-63, 64-n) 
if (RB) 58 = 0 then 

m € MASK(n*32, 63) 
else m € 940 
RA«r&m 


The contents of the low-order 32 bits of register RS 
are shifted right the number of bits specified by 
(RB)ss:63- Bits shifted out of position 63 are lost. 
Zeros are supplied to the vacated positions on the 
left. The 32-bit result is placed into RAs5.63. RAg:3 
are set to zero. Shift amounts from 32 to 63 give a 
zero result. 


Special Registers Altered: 
CRO (if Re=1) 
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Shift Right Algebraic Doubleword 
Immediate XS-form 


sradi RA,RS,SH (Rc=0) 
sradi. RA,RS,SH (Re=1) 


31 RS RA sh 413 sh|Rq 
0 6 11 16 21 30| 31 


n € shs | sho.4 

r € ROTLg4((RS), 64—n) 
m € MASK (n, 63) 

s € (RS)o 

RA € ram | (%s)6m 
CA € s & ((r&7m) #0) 


The contents of register RS are shifted right SH bits. 
Bits shifted out of position 63 are lost. Bit O of RS is 
replicated to fill the vacated positions on the left. The 
result is placed into register RA. CA is set to 1 if (RS) 
is negative and any 1-bits are shifted out of position 
63; otherwise CA is set to 0. A shift amount of zero 
causes RA to be set equal to (RS), and CA to be set 
to 0. 


Special Registers Altered: 
CA 
CRO (if Rc=1) 
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Shift Right Algebraic Word Immediate 
X-form 


srawi RA,RS,SH (Rc=0) 
srawi. RA,RS,SH (Re=1) 
[POWER mnemonics: srai, srai.] 

31 RS RA SH 824 Re 
0 6 11 16 21 31 


n € SH 

rt ROT Lao ( (RS) 39-63, 64-n) 
m € MASK (n+32, 63) 

s € (RS) 3 

RÀ € r&m 7 (845) &m 

CA €5 & ( (r&7m) 32.5370) 


The contents of the low-order 32 bits of register RS 
are shifted right SH bits. Bits shifted out of position 
63 are lost. Bit 32 of RS is replicated to fill the 
vacated positions on the left. The 32-bit result is 
placed into RAs5,,4. Bit 32 of RS is replicated to fill 
RAg:31- CA is set to 1 if the low-order 32 bits of (RS) 
contain a negative number and any 1-bits are shifted 
out of position 63; otherwise CA is set to 0. A shift 
amount of zero causes RA to receive EXTS((RS)s2.69). 
and CA to be set to 0. 


Special Registers Altered: 
CA 
CRO (if Rc=1) 
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Shift Right Algebraic Doubleword 
X-form 


srad RA,RS,RB (Rc=0) 
srad. RA,RS,RB (Re=1) 

31 RS RA RB 794 Re 
0 6 11 16 21 31 
n € (RB) 58:53 


r € ROTLea( (RS), 64-n) 
if (RB) 57 = 0 then 

m € MASK(n, 63) 
else m € 640 
s € (RS)o 
RA € ram | (%s)6m 
CA € s & ((r&7m) #0) 


The contents of register RS are shifted right the 
number of bits specified by (RB)s7.63. Bits shifted out 
of position 63 are lost. Bit 0 of RS is replicated to fill 
the vacated positions on the left. The result is placed 
into register RA. CA is set to 1 if (RS) is negative and 
any 1-bits are shifted out of position 63; otherwise CA 
is setto 0. A shift amount of zero causes RA to be 
set equal to (RS), and CA to be set to O. Shift 
amounts from 64 to 127 give a result of 64 sign bits in 
RA, and cause CA to receive the sign bit of (RS). 


Special Registers Altered: 
CA 
CRO (if Rc=1) 


Shift Right Algebraic Word X-form 


sraw RA,RS,RB 
sraw. RA,RS,RB 


[POWER mnemonics: sra, sra.] 


31 RS RA RB 792 Re 
0 6 11 16 21 31 


n € (RB) 59:63 
rt ROT Lao ( (RS) 39-63, 64-n) 
if (RB) 58 = 0 then 
m € MASK(n+32, 63) 
else m € 940 
s € (RS)3 
RÀ € r&m i (845) &m 
CA €5 & ( (r&7m) 32.5370) 


The contents of the low-order 32 bits of register RS 
are shifted right the number of bits specified by 
(RB)sggs. Bits shifted out of position 63 are lost. Bit 
32 of RS is replicated to fill the vacated positions on 
the left. The 32-bit result is placed into RAs,63. Bit 
32 of RS is replicated to fill RAo.44. CA is set to 1 if 
the low-order 32 bits of (RS) contain a negative 
number and any 1-bits are shifted out of position 63; 
otherwise CA is set to O. A shift amount of zero 
causes RA to receive EXTS((RS)32.63), and CA to be 
set to O. Shift amounts from 32 to 63 give a result of 
64 sign bits, and cause CA to receive the sign bit of 
(RS) 32-63: 


Special Registers Altered: 
CA 
CRO (if Rc=1) 
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3.3.13 Move To/From System Register Instructions 


The Move To Condition Register Fields instruction has 
a preferred form; see Section 1.9.1, “Preferred 
Instruction Forms" on page 12. In the preferred form, 
the FXM field satisfies the following rule. 


m Exactly one bit of the FXM field is set to 1. 


Extended mnemonics 


Extended mnemonics are provided for the mtspr and 
mfspr instructions so that they can be coded with the 


Move To Special Purpose Register 
XFX-form 


mtspr SPR,RS 


0 6 11 21 31 


n € sprgg | spros 

if length(SPREG(n)) = 64 then 
SPREG(n) € (RS) 

else 
SPREG(n) € (RS) 30:63 


The SPR field denotes a Special Purpose Register, 
encoded as shown in the table below. The contents of 
register RS are placed into the designated Special 
Purpose Register. For Special Purpose Registers that 
are 32 bits long, the low-order 32 bits of RS are 
placed into the SPR. 


SPR' Register 
Sp'59 Spro; Name 
00000 00001 XER 
00000 01000 LR 


decimal 


00000 01001 CTR 


* Note that the order of the two 5-bit 
halves of the SPR number is reversed. 


If the SPR field contains any value other than one of 
the values shown above then one of the following 
occurs. 


m The system illegal instruction error handler is 
invoked. 

m The system privileged instruction error handler is 
invoked. 

m The results are boundedly undefined. 


A complete description of this instruction can be 
found in Book Ill, PowerPC AS Operating Environment 
Architecture. 


Special Registers Altered: 
See above 


SPR name as part of the mnemonic rather than as a 
numeric operand. An extended mnemonic is provided 
for the mterf instruction for compatibility with old soft- 
ware (written for a version of the architecture that 
precedes Version 2.00) that uses it to set the entire 
Condition Register. Some of these extended mne- 
monics are shown as examples with the relevant 
instructions. See Appendix B, “Assembler Extended 
Mnemonics” on page 143 for additional extended 
mnemonics. 


Extended Mnemonics: 


Examples of extended mnemonics for Move To 
Special Purpose Register: 


Extended: Equivalent to: 
mtxer Rx mtspr 1,Rx 
mtlr Rx mtspr 8,Rx 
mtctr Rx mtspr 9,Rx 


Compiler and Assembler Note 


For the mtspr and mfspr instructions, the SPR 
number coded in assembler language does not 
appear directly as a 10-bit binary number in the 
instruction. The number coded is split into two 
5-bit halves that are reversed in the instruction, 
with the high-order 5 bits appearing in bits 16:20 
of the instruction and the low-order 5 bits in bits 
11:15. This maintains compatibility with POWER 
SPR encodings, in which these two instructions 
have only a 5-bit SPR field occupying bits 11:15. 


Compatibility Note 


For a discussion of POWER compatibility with 
respect to SPR numbers not shown in the instruc- 
tion descriptions for mtspr and mfspr, see 
Appendix E, “Incompatibilities with the POWER 
Architecture” on page 163. 


Engineering Note 


If MSRpp=1, the only effect of executing this 
instruction with an SPR number in which spro= 1 
is to cause either an Illegal Instruction type 
Program interrupt or a Privileged Instruction type 
Program interrupt. 


Engineering Note 


Any assignment of SPR numbers not shown in the 
Book | instruction descriptions for mtspr and 
mfspr must be done in a manner consistent with 
the section that describes these instructions in 
Book III. 
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Move From Special Purpose Register 


XFX-form 
mfspr RT,SPR 

31 RT spr 339 / 
0 6 11 21 31 


n €sprsg | Spro4 

if length(SPREG(n)) = 64 then 
RT 4 SPREG (n) 

else 
RT € 320 | SPREG(n) 


The SPR field denotes a Special Purpose Register, 
encoded as shown in the table below. The contents of 
the designated Special Purpose Register are placed 
into register RT. For Special Purpose Registers that 
are 32 bits long, the low-order 32 bits of RT receive 
the contents of the Special Purpose Register and the 
high-order 32 bits of RT are set to zero. 


SPR' Register 
decimal ^ sprsg Spro4 Name 


00000 00001 XER 
00000 01000 LR 


00000 01001 CTR 


* Note that the order of the two 5-bit 
halves of the SPR number is reversed. 


If the SPR field contains any value other than one of 
the values shown above then one of the following 


occurs. 

m The system illegal instruction error handler is 
invoked. 

m The system privileged instruction error handler is 
invoked. 


m The results are boundedly undefined. 


A complete description of this instruction can be 
found in Book Ill, PowerPC AS Operating Environment 
Architecture. 


Special Registers Altered: 
None 
Extended Mnemonics: 


Examples of extended mnemonics for Move From 
Special Purpose Register: 


Extended: Equivalent to: 
mixer Rx mfspr Rx 
mflr Rx mfspr Rx,8 
mfctr Rx mfspr Rx,9 


Note 
See the Notes that appear with mtspr. 
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Move To Condition Register Fields 
XFX-form 


mtcrf FXM,RS 


31 RS |0 FXM / 144 / 
0 6 11] 12 20/21 31 


mask € ^(FXMg) | 4(FXM,) |... 
CR + ( (RS) 32:63 & mask) 


^ (FXM7) 
(CR & mask) 


The contents of bits 32:63 of register RS are placed 
into the Condition Register under control of the field 
mask specified by FXM. The field mask identifies the 
4-bit fields affected. Let i be an integer in the range 
0-7. If FXM¡=1 then CR field i (CR bits 4xi:4xi+3) is 
set to the contents of the corresponding field of the 
low-order 32 bits of RS. 


Special Registers Altered: 
CR fields selected by mask 
Extended Mnemonics: 
Example of extended mnemonics for Move To Condi- 
tion Register Fields: 


Extended: Equivalent to: 
mtcr Rx mtcrf OxFF,Rx 


Programming Note 


In the preferred form of this instruction (see the 
introduction to Section 3.3.13), only one Condition 
Register field is updated. 


Engineering Note 


See the description of the optional version of 
mtcrf in Section 5.1.1 for additional information 
about this instruction. 


Move to Condition Register from XER 
X-form 


mcrxr BF 


31 BF |//| 4 Il 512 / 
0 6 9 |11 16 21 31 


CR4xBF:4xBF+3 € XER32:35 
XER32:35 € 0b0000 


The contents of XERz>.35 are copied to Condition Reg- 
ister field BF. XERgo.35 are set to zero. 


Special Registers Altered: 
CR field BF XER 55.35 


Move From Condition Register 


XFX-form 
mfcr RT 

31 RT |0 Hl 19 / 
0 6 11112 21 31 
RT € 920 | CR 


The contents of the Condition Register are placed into 
RTa23. RT 9-3; are set to 0. 


Special Registers Altered: 
None 


Engineering Note 


See the description of the optional version of mfcr 
in Section 5.1.1 for additional information about 
this instruction. 
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Chapter 4. Floating-Point Processor 


4.1 Floating-Point Processor Overview 81 
4.2 Floating-Point Processor Registers 82 
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4.3.4 Normalization and 
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4.4 Floating-Point Exceptions ...... 89 
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4.1 Floating-Point Processor 
Overview 


This chapter describes the registers and instructions 
that make up the Floating-Point Processor facility. 
Section 4.2, “Floating-Point Processor Registers” on 
page 82 describes the registers associated with the 
Floating-Point Processor. Section 4.6, “Floating-Point 
Processor Instructions” on page 97 describes the 
instructions associated with the Floating-Point 
Processor. 


4.4.5.1 Definition 
4.4.5.2 Action 
4.5 Floating-Point Execution Models .. 94 
4.5.1 Execution Model for IEEE 


Operations ................ 95 
4.5.2 Execution Model for Multiply-Add 

Type Instructions ............. 96 
4.6 Floating-Point Processor 

Instructions ................ 97 
4.6.1 Floating-Point Storage Access 

Instructions ................ 98 


4.6.1.1 Storage Access Exceptions ... 98 
4.6.2 Floating-Point Load Instructions . 98 
4.6.3 Floating-Point Store Instructions 101 
4.6.4 Floating-Point Move Instructions 105 
4.6.5 Floating-Point Arithmetic 


Instructions ............... 106 
4.6.5.1 Floating-Point Elementary 

Arithmetic Instructions ........ 106 
4.6.5.2 Floating-Point Multiply-Add 

Instructions ............... 108 


4.6.6 Floating-Point Rounding and 
Conversion Instructions ........ 110 
4.6.7 Floating-Point Compare 


Instructions ............... 114 
4.6.8 Floating-Point Status and Control 
Register Instructions ......... 115 


This architecture specifies that the processor imple- 
ment a floating-point system as defined in ANSI/IEEE 
Standard 754-1985, “IEEE Standard for Binary 
Floating-Point Arithmetic” (hereafter referred to as 
“the IEEE standard”), but requires software support in 
order to conform fully with that standard. That 
standard defines certain required “operations” (addi- 
tion, subtraction, etc.); the term “floating-point opera- 
tion” is used in this chapter to refer to one of these 
required operations, or to the operation performed by 
one of the Multiply-Add or Reciprocal Estimate 
instructions. All floating-point operations conform to 
that standard. 
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Instructions are provided to perform arithmetic, 
rounding, conversion, comparison, and other oper- 
ations in floating-point registers; to move floating- 
point data between storage and these registers; and 
to manipulate the Floating-Point Status and Control 
Register explicitly. 


These instructions are divided into two categories. 


m computational instructions 

The computational instructions are those that 
perform addition, subtraction, multiplication, divi- 
sion, extracting the square root, rounding, con- 
version, comparison, and combinations of these 
operations. These instructions provide the float- 
ing-point operations. They place status informa- 
tion into the Floating-Point Status and Control 
Register. They are the instructions described in 
Sections 4.6.5 through 4.6.7 and Section 5.2.1. 


m non-computational instructions 

The non-computational instructions are those that 
perform loads and stores, move the contents of a 
floating-point register to another floating-point 
register possibly altering the sign, manipulate the 
Floating-Point Status and Control Register explic- 
itly, and select the value from one of two float- 
ing-point registers based on the value in a third 
floating-point register. The operations performed 
by these instructions are not considered floating- 
point operations. With the exception of the 
instructions that manipulate the Floating-Point 
Status and Control Register explicitly, they do not 
alter the Floating-Point Status and Control Reg- 
ister. They are the instructions described in 
Sections 4.6.2 through 4.6.4, 4.6.8, and 5.2.2. 


A floating-point number consists of a signed exponent 
and a signed significand. The quantity expressed by 
this number is the product of the significand and the 
number 29*Ponent Encodings are provided in the data 
format to represent finite numeric values, z Infinity, 
and values that are "Not a Number" (NaN). Oper- 
ations involving infinities produce results obeying tra- 
ditional mathematical conventions. NaNs have no 
mathematical interpretation. Their encoding permits 
a variable diagnostic information field. They may be 
used to indicate such things as uninitialized variables 
and can be produced by certain invalid operations. 


There is one class of exceptional events that occur 
during instruction execution that is unique to the 
Floating-Point Processor: the Floating-Point Exception. 
Floating-point exceptions are signaled with bits set in 
the Floating-Point Status and Control Register 
(FPSCR). They can cause the system floating-point 
enabled exception error handler to be invoked, pre- 
cisely or imprecisely, if the proper control bits are set. 


Floating-Point Exceptions 


The following floating-point exceptions are detected 
by the processor: 


m Invalid Operation Exception (VX 
SNaN (VXSNAN 
Infinity— Infinity (VXISI 
Infinity = Infinity (VXIDI 
Zero- Zero (VXZDZ 


InfinityxZero 
Invalid Compare 
Software Request (VXSOFT 


Invalid Square Root (VXSQRT 
Invalid Integer Convert (VXCVI 
m Zero Divide Exception (ZX 
m Overflow Exception (OX 
m Underflow Exception (UX 
m Inexact Exception (XX 


Each floating-point exception, and each category of 
Invalid Operation Exception, has an exception bit in 
the FPSCR. In addition, each floating-point exception 
has a corresponding enable bit in the FPSCR. See 
Section 4.2.2, “Floating-Point Status and Control 
Register” on page 83 for a description of these 
exception and enable bits, and Section 4.4, “Floating- 
Point Exceptions” on page 89 for a detailed dis- 
cussion of floating-point exceptions, including the 
effects of the enable bits. 


4.2 Floating-Point Processor 
Registers 


4.2.1 Floating-Point Registers 


Implementations of this architecture provide 32 float- 
ing-point registers (FPRs). The floating-point instruc- 
tion formats provide 5-bit fields for specifying the 
FPRs to be used in the execution of the instruction. 
The FPRs are numbered 0-31. See Figure 26 on 
page 83. 


Each FPR contains 64 bits that support the floating- 
point double format. Every instruction that interprets 
the contents of an FPR as a floating-point value uses 
the floating-point double format for this interpretation. 


The computational instructions, and the Move and 
Select instructions, operate on data located in FPRs 
and, with the exception of the Compare instructions, 
place the result value into an FPR and optionally 
place status information into the Condition Register. 


Load Double and Store Double instructions are pro- 
vided that transfer 64 bits of data between storage 
and the FPRs with no conversion. Load Single 
instructions are provided to transfer and convert 
floating-point values in floating-point single format 
from storage to the same value in floating-point 
double format in the FPRs. Store Single instructions 
are provided to transfer and convert floating-point 
values in floating-point double format from the FPRs 
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to the same value in floating-point single format in 
storage. 


Instructions are provided that manipulate the 
Floating-Point Status and Control Register and the 
Condition Register explicitly. Some of these 
instructions copy data from an FPR to the Floating- 
Point Status and Control Register or vice versa. 


The computational instructions and the Select instruc- 
tion accept values from the FPRs in double format. 
For single-precision arithmetic instructions, all input 
values must be representable in single format; if they 
are not, the result placed into the target FPR, and the 
setting of status bits in the FPSCR and in the Condi- 
tion Register (if Rc=1), are undefined. 


Figure 26. Floating-Point Registers 


4.2.2 Floating-Point Status and 
Control Register 


The Floating-Point Status and Control Register 
(FPSCR) controls the handling of floating-point 
exceptions and records status resulting from the float- 
ing-point operations. Bits 0:23 are status bits. Bits 
24:31 are control bits. 


The exception bits in the FPSCR (bits 3:12, 21:23) are 
sticky; that is, once set to 1 they remain set to 1 until 
they are set to 0 by an merfs, mtfsfi, mtfsf, or mtfsb0 
instruction. The exception summary bits in the FPSCR 
(FX, FEX, and VX, which are bits 0:2) are not consid- 
ered to be “exception bits”, and only FX is sticky. 


FEX and VX are simply the ORs of other FPSCR bits. 
Therefore these two bits are not listed among the 
FPSCR bits affected by the various instructions. 


FPSCR 
0 31 


Figure 27. Floating-Point Status and Control Register 


The bit definitions for the FPSCR are as follows. 
Bit(s) Description 


0 Floating-Point Exception Summary (FX) 
Every floating-point instruction, except mtfsfi 
and mtfsf, implicitly sets FPSCRey to 1 if that 
instruction causes any of the floating-point 
exception bits in the FPSCR to change from 0 to 
1. merfs, mtfsfi, mtfsf, mtfsb0, and mtfsb1 can 
alter FPSCRey explicitly. 


1 Floating-Point Enabled Exception Summary 
(FEX) 
This bit is the OR of all the floating-point excep- 
tion bits masked by their respective enable bits. 
merfs, mtfsfi, mtfsf, mtfsb0, and mtfsb1 cannot 
alter FPSCRegy explicitly. 


2 Floating-Point Invalid Operation 
Summary (VX) 
This bit is the OR of all the Invalid Operation 
exception bits. merfs, mtfsfi, mtfsf, mtfsb0, and 
mtfsb1 cannot alter FPSCRyx explicitly. 


Exception 


3 Floating-Point Overflow Exception (OX) 
See Section 4.4.3, “Overflow Exception” on 
page 93. 


4 Floating-Point Underflow Exception (UX) 
See Section 4.4.4, “Underflow Exception” on 
page 93. 


5 Floating-Point Zero Divide Exception (ZX) 
See Section 4.4.2, “Zero Divide Exception” on 
page 92. 


6 Floating-Point Inexact Exception (XX) 
See Section 4.4.5, “Inexact Exception” on 
page 94. 


FPSCRxx is a sticky version of FPSCRg, (see 
below). Thus the following rules completely 
describe how FPSCRxx is set by a given instruc- 
tion. 


m |f the instruction affects FPSCRg, the new 
value of FPSCRxx is obtained by ORing the 
old value of FPSCRxx with the new value of 
FPSCRg. 

m |f the instruction does not affect FPSCRg, 
the value of FPSCRxx is unchanged. 


7 Floating-Point Invalid Operation 
(SNaN) (VXSNAN) 
See Section 4.4.1, “Invalid Operation Exception” 
on page 91. 


Exception 


8 Floating-Point Invalid Operation 
(oo — oo ) (VXISI) 
See Section 4.4.1, "Invalid Operation Exception" 
on page 91. 


Exception 


9 Floating-Point Invalid Operation 
(oo = co ) (VXIDI) 
See Section 4.4.1, “Invalid Operation Exception” 
on page 91. 


Exception 
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10 


11 


12 


13 


14 


Floating-Point 
(0= 0) (VXZDZ) 
See Section 4.4.1, “Invalid Operation Exception” 
on page 91. 


Invalid Operation Exception 


Floating-Point 
(œ x0) (VXIMZ) 
See Section 4.4.1, “Invalid Operation Exception” 
on page 91. 


Invalid Operation Exception 


Floating-Point Invalid Operation 
(Invalid Compare) (VXVC) 

See Section 4.4.1, “Invalid Operation Exception” 
on page 91. 


Exception 


Floating-Point Fraction Rounded (FR) 

The last Arithmetic or Rounding and Conversion 
instruction incremented the fraction during 
rounding. See Section 4.3.6, “Rounding” on 
page 88. This bit is not sticky. 


Floating-Point Fraction Inexact (Fl) 

The last Arithmetic or Rounding and Conversion 
instruction either produced an inexact result 
during rounding or caused a disabled Overflow 
Exception. See Section 4.3.6, “Rounding” on 
page 88. This bit is not sticky. 


See the definition of FPSCRyx, above, regarding 
the relationship between FPSCRr, and FPSCRxx. 


15:19 Floating-Point Result Flags (FPRF) 


15 


This field is set as described below. For arith- 
metic, rounding, and conversion instructions, 
the field is set based on the result placed into 
the target register, except that if any portion of 
the result is undefined then the value placed 
into FPRF is undefined. 


Floating-Point Result Class Descriptor (C) 
Arithmetic, rounding, and conversion 
instructions may set this bit with the FPCC bits, 
to indicate the class of the result as shown in 
Figure 28 on page 85. 


16:19 Floating-Point Condition Code (FPCC) 


16 
17 


18 
19 
20 


Floating-point Compare instructions set one of 
the FPCC bits to 1 and the other three FPCC 
bits to O. Arithmetic, rounding, and conversion 
instructions may set the FPCC bits with the C 
bit, to indicate the class of the result as shown 
in Figure 28 on page 85. Note that in this case 
the high-order three bits of the FPCC retain 
their relational significance indicating that the 
value is less than, greater than, or equal to 
zero. 


Floating-Point Less Than or Negative (FL or <) 


Floating-Point Greater Than or Positive (FG or 
>) 


Floating-Point Equal or Zero (FE or =) 
Floating-Point Unordered or NaN (FU or ?) 


Reserved 


21 


22 


23 


24 


25 


26 


27 


28 


29 


Floating-Point Invalid Operation 
(Software Request) (VXSOFT) 
This bit can be altered only by merfs, mtfsfi, 
mtfsf, mtfsb0, or mtfsb1. See Section 4.4.1, 
“Invalid Operation Exception” on page 91. 


Exception 


Floating-Point Invalid Operation 
(Invalid Square Root) (VXSQRT) 
See Section 4.4.1, “Invalid Operation Exception” 
on page 91. 


Exception 


r— Architecture Note 


This bit is defined even for implementations 
that do not support either of the two 
optional instructions that set it, namely 
Floating Square Root and Floating Recip- 
rocal Square Root Estimate. Defining it for 
all implementations gives software a 
standard interface for handling square root 
exceptions. 


r— Programming Note 


If the implementation does not support the 
optional Floating Square Root or Floating 
Reciprocal Square Root Estimate instruction, 
software can simulate the instruction and 
set this bit to reflect the exception. 


Floating-Point Invalid Operation 
(Invalid Integer Convert) (VXCVI) 
See Section 4.4.1, “Invalid Operation Exception” 
on page 91. 


Exception 


Floating-Point 
Enable (VE) 
See Section 4.4.1, “Invalid Operation Exception” 
on page 91. 


Invalid Operation Exception 


Floating-Point Overflow Exception Enable (OE) 
See Section 4.4.3, “Overflow Exception” on 
page 93. 


Floating-Point Underflow Exception Enable (UE) 
See Section 4.4.4, “Underflow Exception” on 
page 93. 


Floating-Point Zero Divide Exception Enable 
(ZE) 

See Section 4.4.2, “Zero Divide Exception” on 
page 92. 


Floating-Point Inexact Exception Enable (XE) 
See Section 4.4.5, “Inexact Exception” on 
page 94. 


Reserved 


Architecture Note 


This bit will be among the last to be 
assigned a meaning. It was the NI 


(Non-IEEE Mode) bit in earlier versions of 
the architecture. 
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30:31 Floating-Point Rounding Control (RN) 
See Section 4.3.6, “Rounding” on page 88. 


00 Round to Nearest 

01 Round toward Zero 

10 Round toward + Infinity 
11 Round toward - Infinity 


Result 
Flags 


Result Value Class 


Oo 
^ 
v 
rm 
~ 


Quiet NaN 

Infinity 

Normalized Number 
Denormalized Number 
Zero 

Zero 


Denormalized Number 
Normalized Number 
Infinity 


044044400 
9000044440 
2000000 
o0oo0o0-0000 
000000 + 


Figure 28. Floating-Point Result Flags 


4.3 Floating-Point Data 


4.3.1 Data Format 


This architecture defines the representation of a float- 
ing-point value in two different binary fixed-length 
formats. The format may be a 32-bit single format for 
a single-precision value or a 64-bit double format for 
a double-precision value. The single format may be 
used for data in storage. The double format format 
may be used for data in storage and for data in float- 
ing-point registers. 


The lengths of the exponent and the fraction fields 
differ between these two formats. The structure of 
the single and double formats is shown below. 


01 9 31 


Figure 29. Floating-point single format 


S EXP FRACTION 
01 12 63 


Figure 30. Floating-point double format 


Values in floating-point format are composed of three 
fields: 


S sign bit 
EXP exponent+bias 
FRACTION fraction 


Representation of numeric values in the floating-point 
formats consists of a sign bit (S), a biased exponent 
(EXP), and the fraction portion (FRACTION) of the 
significand. The significand consists of a leading 
implied bit concatenated on the right with the FRAC- 
TION. This leading implied bit is 1 for normalized 
numbers and O for denormalized numbers and is 
located in the unit bit position (i.e., the first bit to the 
left of the binary point). Values representable within 
the two floating-point formats can be specified by the 
parameters listed in Figure 31. 


mg | mone | 


Exponent Bias 
Maximum Exponent 
Minimum Exponent 


Widths (bits) 
Format 
Sign 
Exponent 
Fraction 
Significand 


Figure 31. IEEE floating-point fields 


The architecture requires that the FPRs of the 
Floating-Point Processor support the floating-point 
double format only. 


4.3.2 Value Representation 


This architecture defines numeric and non-numeric 
values representable within each of the two supported 
formats. The numeric values are approximations to 
the real numbers and include the normalized 
numbers, denormalized numbers, and zero values. 
The non-numeric values representable are the infin- 
ities and the Not a Numbers (NaNs). The infinities are 
adjoined to the real numbers, but are not numbers 
themselves, and the standard rules of arithmetic do 
not hold when they are used in an operation. They 
are related to the real numbers by order alone. It is 
possible however to define restricted operations 
among numbers and infinities as defined below. The 
relative location on the real number line for each of 
the defined entities is shown in Figure 32. 


-INF | -NOR [-DEN |-0 |+0 |+DEN | +NOR |tINE 
«— | at | > 


Figure 32. Approximation to real numbers 


The NaNs are not related to the numeric values or 
infinities by order or value but are encodings used to 
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convey diagnostic information such as the represen- 
tation of uninitialized variables. 


The following is a description of the different floating- 
point values defined in the architecture: 


Binary floating-point numbers 

Machine representable values used as approxi- 
mations to real numbers. Three categories of 
numbers are supported: normalized numbers, denor- 
malized numbers, and zero values. 


Normalized numbers (+ NOR) 
These are values that have a biased exponent value 
in the range: 


1 to 254 in single format 
1 to 2046 in double format 


They are values in which the implied unit bit is 1. 
Normalized numbers are interpreted as follows: 


NOR = (—1)§ x 2E x (1.fraction) 


where s is the sign, Eis the unbiased exponent, and 
1.fraction is the significand, which is composed of a 
leading unit bit (implied bit) and a fraction part. 


The ranges covered by the magnitude (M) of a nor- 
malized floating-point number are approximately 
equal to: 


Single Format: 
1.2x10738 < M < 3.4x1038 


Double Format: 
2.2x10-308 < M < 1.8x10308 


Zero values (+ 0) 

These are values that have a biased exponent value 
of zero and a fraction value of zero. Zeros can have 
a positive or negative sign. The sign of zero is 
ignored by comparison operations (i.e., comparison 
regards +0 as equal to - 0). 


Denormalized numbers (+ DEN) 

These are values that have a biased exponent value 
of zero and a nonzero fraction value. They are 
nonzero numbers smaller in magnitude than the 
representable normalized numbers. They are values 
in which the implied unit bit is 0. Denormalized 
numbers are interpreted as follows: 


DEN = (-1)S x 25min x (0.fraction) 


where Emin is the minimum representable exponent 
value (- 126 for single-precision, — 1022 for double- 
precision). 


Infinities (+ oo) 
These are values that have the maximum biased 
exponent value: 


255 in single format 
2047 in double format 


and a zero fraction value. They are used to approxi- 
mate values greater in magnitude than the maximum 
normalized value. 


Infinity arithmetic is defined as the limiting case of 
real arithmetic, with restricted operations defined 
among numbers and infinities. Infinities and the real 
numbers can be related by ordering in the affine 
sense: 


— co < every finite number < + oo 


Arithmetic on infinities is always exact and does not 
signal any exception, except when an exception 
occurs due to the invalid operations as described in 
Section 4.4.1, “Invalid Operation Exception” on 
page 91. 


Not a Numbers (NaNs) 

These are values that have the maximum biased 
exponent value and a nonzero fraction value. The 
sign bit is ignored (i.e., NaNs are neither positive nor 
negative). If the high-order bit of the fraction field is 
O then the NaN is a Signaling NaN; otherwise it is a 
Quiet NaN. 


Signaling NaNs are used to signal exceptions when 
they appear as operands of computational 
instructions. 


Quiet NaNs are used to represent the results of 
certain invalid operations, such as invalid arithmetic 
operations on infinities or on NaNs, when Invalid 
Operation Exception is disabled (FPSCRys=0). Quiet 
NaNs propagate through all floating-point operations 
except ordered comparison, Floating Round to Single- 
Precision, and conversion to integer. Quiet NaNs do 
not signal exceptions, except for ordered comparison 
and conversion to integer operations. Specific 
encodings in QNaNs can thus be preserved through a 
sequence of floating-point operations, and used to 
convey diagnostic information to help identify results 
from invalid operations. 


When a QNaN is the result of a floating-point opera- 
tion because one of the operands is a NaN or because 
a QNaN was generated due to a disabled Invalid 
Operation Exception, then the following rule is applied 
to determine the NaN with the high-order fraction bit 
set to 1 that is to be stored as the result. 
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if (FRA) is a NaN 
then FRT € (FRA) 
else if (FRB) is a NaN 
then if instruction is frsp 
then FRT € (FRB)o-34 | 20 
else FRT € (FRB) 
else if (FRC) is a NaN 
then FRT € (FRC) 
else if generated QNaN 
then FRT € generated QNaN 


If the operand specified by FRA is a NaN, then that 
NaN is stored as the result. Otherwise, if the operand 
specified by FRB is a NaN (if the instruction specifies 
an FRB operand), then that NaN is stored as the 
result, with the low-order 29 bits of the result set to 0 
if the instruction is frsp. Otherwise, if the operand 
specified by FRC is a NaN (if the instruction specifies 
an FRC operand), then that NaN is stored as the 
result. Otherwise, if a QNaN was generated due to a 
disabled Invalid Operation Exception, then that QNaN 
is stored as the result. If a QNaN is to be generated 
as a result, then the QNaN generated has a sign bit of 
0, an exponent field of all 1s, and a high-order fraction 
bit of 1 with all other fraction bits 0. Any instruction 
that generates a QNaN as the result of a disabled 
Invalid Operation Exception generates this QNaN (i.e., 
Ox7FF8 0000 0000 0000). 


A double-precision NaN is considered to be represent- 
able in single format if and only if the low-order 29 
bits of the double-precision NaN's fraction are zero. 


4.3.3 Sign of Result 


The following rules govern the sign of the result of an 
arithmetic, rounding, or conversion operation, when 
the operation does not yield an exception. They apply 
even when the operands or results are zeros or infin- 
ities. 

m The sign of the result of an add operation is the 

sign of the operand having the larger absolute 
value. If both operands have the same sign, the 
sign of the result of an add operation is the same 
as the sign of the operands. The sign of the 
result of the subtract operation x-y is the same 
as the sign of the result of the add operation 
x+(—y). 
When the sum of two operands with opposite 
sign, or the difference of two operands with the 
same sign, is exactly zero, the sign of the result 
is positive in all rounding modes except Round 
toward — Infinity, in which mode the sign is nega- 
tive. 


m The sign of the result of a multiply or divide oper- 
ation is the Exclusive OR of the signs of the oper- 
ands. 


m The sign of the result of a Square Root or Recip- 
rocal Square Root Estimate operation is always 


positive, except that the square root of -0 is —0 
and the reciprocal square root of — O is — Infinity. 


m The sign of the result of a Round to Single- 
Precision or Convert To/From Integer operation is 
the sign of the operand being converted. 


For the Multiply-Add instructions, the rules given 
above are applied first to the multiply operation and 
then to the add or subtract operation (one of the 
inputs to the add or subtract operation is the result of 
the multiply operation). 


4.3.4 Normalization and 
Denormalization 


The intermediate result of an arithmetic or frsp 
instruction may require normalization and/or denor- 
malization as described below. Normalization and 
denormalization do not affect the sign of the result. 


When an arithmetic or frsp instruction produces an 
intermediate result, consisting of a sign bit, an expo- 
nent, and a nonzero significand with a 0 leading bit, it 
is not a normalized number and must be normalized 
before it is stored. 


A number is normalized by shifting its significand left 
while decrementing its exponent by 1 for each bit 
shifted, until the leading significand bit becomes 1. 
The Guard bit and the Round bit (see Section 4.5.1, 
“Execution Model for IEEE Operations” on page 95) 
participate in the shift with zeros shifted into the 
Round bit. The exponent is regarded as if its range 
were unlimited. 


After normalization, or if normalization was not 
required, the intermediate result may have a nonzero 
significand and an exponent value that is less than 
the minimum value that can be represented in the 
format specified for the result. In this case, the inter- 
mediate result is said to be “Tiny” and the stored 
result is determined by the rules described in Section 
4.4.4, “Underflow Exception” on page 93. These rules 
may require denormalization. 


A number is denormalized by shifting its significand 
right while incrementing its exponent by 1 for each bit 
shifted, until the exponent is equal to the format's 
minimum value. If any significant bits are lost in this 
shifting process then “Loss of Accuracy” has occurred 
(See Section 4.4.4, “Underflow Exception” on 
page 93) and Underflow Exception is signaled. 


Engineering Note 


When denormalized numbers are operands of 
multiply, divide, and square root operations, some 
implementations may prenormalize the operands 
internally before performing the operations. 
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4.3.5 Data Handling and Precision 


Instructions are defined to move floating-point data 
between the FPRs and storage. For double format 
data, the data are not altered during the move. For 
single format data, a format conversion from single to 
double is performed when loading from storage into 
an FPR and a format conversion from double to single 
is performed when storing from an FPR to storage. 
No floating-point exceptions are caused by these 
instructions. 


All computational, Move, and Select instructions use 
the floating-point double format. 


Floating-point single-precision is obtained with the 
implementation of four types of instruction. 


1. Load Floating-Point Single 


This form of instruction accesses a single- 
precision operand in single format in storage, 
converts it to double format, and loads it into an 
FPR. No floating-point exceptions are caused by 
these instructions. 


2. Round to Floating-Point Single-Precision 


The Floating Round to Single-Precision instruction 
rounds a double-precision operand to single- 
precision, checking the exponent for single- 
precision range and handling any exceptions 
according to respective enable bits, and places 
that operand into an FPR as a double-precision 
operand. For results produced by single- 
precision arithmetic instructions, single-precision 
loads, and other instances of the Floating Round 
to Single-Precision instruction, this operation 
does not alter the value. 


3. Single-Precision Arithmetic Instructions 


This form of instruction takes operands from the 
FPRs in double format, performs the operation as 
if it produced an intermediate result having infi- 
nite precision and unbounded exponent range, 
and then coerces this intermediate result to fit in 
single format. Status bits, in the FPSCR and 
optionally in the Condition Register, are set to 
reflect the single-precision result. The result is 
then converted to double format and placed into 
an FPR. The result lies in the range supported by 
the single format. 


All input values must be representable in single 
format; if they are not, the result placed into the 
target FPR, and the setting of status bits in the 
FPSCR and in the Condition Register (if Rc=1), 
are undefined. 


4. Store Floating-Point Single 


This form of instruction converts a double- 
precision operand to single format and stores 
that operand into storage. No floating-point 
exceptions are caused by these instructions. 


(The value being stored is effectively assumed to 
be the result of an instruction of one of the pre- 
ceding three types.) 


When the result of a Load Floating-Point Single, 
Floating Round to Single-Precision, or single-precision 
arithmetic instruction is stored in an FPR, the low- 
order 29 FRACTION bits are zero. 


Programming Note 


The Floating Round to Single-Precision instruction 
is provided to allow value conversion from 
double-precision to single-precision with appro- 
priate exception checking and rounding. This 
instruction should be used to convert double- 
precision floating-point values (produced by 
double-precision load and arithmetic instructions 
and by fcfid) to single-precision values prior to 
storing them into single format storage elements 
or using them as operands for single-precision 
arithmetic instructions. Values produced by 
single-precision load and arithmetic instructions 
are already single-precision values and can be 
stored directly into single format storage ele- 
ments, or used directly as operands for single- 
precision arithmetic instructions, without 


preceding the store, or the arithmetic instruction, 
by a Floating Round to Single-Precision instruc- 
tion. 


Programming Note 


A single-precision value can be used in double- 
precision arithmetic operations. The reverse is 
true only if the double-precision value is repre- 
sentable in single format. 


Some implementations may execute single- 
precision arithmetic instructions faster than 
double-precision arithmetic instructions. There- 
fore, if double-precision accuracy is not required, 
single-precision data and instructions should be 
used. 


4.3.6 Rounding 


The material in this section applies to operations that 
have numeric operands (i.e., operands that are not 
infinities or NaNs). Rounding the intermediate result 
of such an operation may cause an Overflow Excep- 
tion, an Underflow Exception, or an Inexact Exception. 
The remainder of this section assumes that the opera- 
tion causes no exceptions and that the result is 
numeric. See Section 4.3.2, “Value Representation” 
on page 85 and Section 4.4, “Floating-Point 
Exceptions” on page 89 for the cases not covered 
here. 


The arithmetic, rounding, and conversion instructions 
produce an intermediate result that can be regarded 
as having infinite precision and unbounded exponent 
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range. This intermediate result is normalized or 
denormalized if required, then rounded to the destina- 
tion format. The final result is then placed into the 
target FPR in double format or in fixed-point integer 
format, depending on the instruction. 


The instructions that round their intermediate result 
are the Arithmetic and Rounding and Conversion 
instructions. Each of these instructions sets FPSCR 
bits FR and FI. If the fraction was incremented during 
rounding then FR is set to 1, otherwise FR is set to 0. 
If the rounded result is inexact then Fl is set to 1, oth- 
erwise FI is set to 0. 


The two Estimate instructions set FR and Fl to unde- 
fined values. The remaining floating-point instructions 
do not alter FR and FI. 


Four user-selectable rounding modes are provided 
through the Floating-Point Rounding Control field in 
the FPSCR. See Section 4.2.2, “Floating-Point Status 
and Control Register” on page 83. These are 
encoded as follows: 


RN Rounding Mode 

00 Round to Nearest 

01 Round toward Zero 

10 Round toward + Infinity 
11 Round toward - Infinity 


Let Z be the intermediate arithmetic result or the 
operand of a convert operation. If Z can be repres- 
ented exactly in the target format, then the result in 
all rounding modes is Z as represented in the target 
format. If Z cannot be represented exactly in the 
target format, let Z1 and Z2 bound Z as the next 
larger and next smaller numbers representable in the 
target format. Then Z1 or Z2 can be used to approxi- 
mate the result in the target format. 


Figure 33 shows the relation of Z, Z1, and Z2 in this 
case. The following rules specify the rounding in the 
four modes. "LSB" means “least significant bit". 


By Incrementing LSB of Z 
Infinitely Precise Value 
[— By Truncating after LSB — 


22| 21 0 22| 71 
Z Z 
Negative values 4— > Positive values 


Figure 33. Selection of Z1 and Z2 


Round to Nearest 
Choose the value that is closer to Z (Z1 or Z2). 
In case of a tie, choose the one that is even 
(least significant bit 0). 


Round toward Zero 
Choose the smaller in magnitude (Z1 or Z2). 


Round toward + Infinity 
Choose Z1. 


Round toward - Infinity 
Choose Z2. 


See Section 4.5.1, “Execution Model for IEEE 
Operations” on page 95 for a detailed explanation of 
rounding. 


4.4 Floating-Point Exceptions 


This architecture defines the following floating-point 
exceptions: 


m Invalid Operation Exception 
SNaN 
Infinity— Infinity 
Infinity = Infinity 
Zero- Zero 
InfinityxZero 
Invalid Compare 
Software Request 
Invalid Square Root 
Invalid Integer Convert 

m Zero Divide Exception 

m Overflow Exception 

m Underflow Exception 

m Inexact Exception 


These exceptions may occur during execution of com- 
putational instructions. In addition, an Invalid Opera- 
tion Exception occurs when a Move To FPSCR 
instruction sets FPSCRyxsorr to 1 (Software Request). 


Each floating-point exception, and each category of 
Invalid Operation Exception, has an exception bit in 
the FPSCR. In addition, each floating-point exception 
has a corresponding enable bit in the FPSCR. The 
exception bit indicates occurrence of the corre- 
sponding exception. If an exception occurs, the corre- 
sponding enable bit governs the result produced by 
the instruction and, in conjunction with the FEO and 
FE1 bits (see page 90), whether and how the system 
floating-point enabled exception error handler is 
invoked. (In general, the enabling specified by the 
enable bit is of invoking the system error handler, not 
of permitting the exception to occur. The occurrence 
of an exception depends only on the instruction and 
its inputs, not on the setting of any control bits. The 
only deviation from this general rule is that the occur- 
rence of an Underflow Exception may depend on the 
setting of the enable bit.) 


A single instruction, other than mtfsfi or mtfsf, may 
set more than one exception bit only in the following 
cases: 


m Inexact Exception may be set with Overflow 
Exception. 

m Inexact Exception may be set with Underflow 
Exception. 

m Invalid Operation Exception (SNaN) is set with 
Invalid Operation Exception (co x0) for 
Multiply-Add instructions for which the values 
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being multiplied are infinity and zero and the 
value being added is an SNaN. 

m Invalid Operation Exception (SNaN) may be set 
with Invalid Operation Exception (Invalid 
Compare) for Compare Ordered instructions. 

m Invalid Operation Exception (SNaN) may be set 
with Invalid Operation Exception (Invalid Integer 
Convert) for Convert To Integer instructions. 


When an exception occurs the instruction execution 
may be suppressed or a result may be delivered, 
depending on the exception. 


Instruction execution is suppressed for the following 
kinds of exception, so that there is no possibility that 
one of the operands is lost: 


m Enabled Invalid Operation 
m Enabled Zero Divide 


For the remaining kinds of exception, a result is gen- 
erated and written to the destination specified by the 
instruction causing the exception. The result may be 
a different value for the enabled and disabled condi- 
tions for some of these exceptions. The kinds of 
exception that deliver a result are the following: 


Disabled Invalid Operation 
Disabled Zero Divide 
Disabled Overflow 
Disabled Underflow 
Disabled Inexact 

Enabled Overflow 

Enabled Underflow 
Enabled Inexact 


Subsequent sections define each of the floating-point 
exceptions and specify the action that is taken when 
they are detected. 


The IEEE standard specifies the handling of excep- 
tional conditions in terms of “traps” and “trap han- 
dlers”. In this architecture, an FPSCR exception 
enable bit of 1 causes generation of the result value 
specified in the IEEE standard for the “trap enabled” 
case; the expectation is that the exception will be 
detected by software, which will revise the result. An 
FPSCR exception enable bit of 0 causes generation of 
the “default result” value specified for the “trap disa- 
bled” (or “no trap occurs” or “trap is not imple- 
mented”) case; the expectation is that the exception 
will not be detected by software, which will simply use 
the default result. The result to be delivered in each 
case for each exception is described in the sections 
below. 


The IEEE default behavior when an exception occurs 
is to generate a default value and not to notify soft- 
ware. In this architecture, if the IEEE default behavior 
when an exception occurs is desired for all 
exceptions, all FPSCR exception enable bits should be 
set to O and Ignore Exceptions Mode (see below) 
should be used. In this case the system floating-point 
enabled exception error handler is not invoked, even 


if floating-point exceptions occur: software can inspect 
the FPSCR exception bits if necessary, to determine 
whether exceptions have occurred. 


In this architecture, if software is to be notified that a 
given kind of exception has occurred, the corre- 
sponding FPSCR exception enable bit must be set to 1 
and a mode other than Ignore Exceptions Mode must 
be used. In this case the system floating-point 
enabled exception error handler is invoked if an 
enabled floating-point exception occurs. The system 
floating-point enabled exception error handler is also 
invoked if a Move To FPSCR instruction causes an 
exception bit and the corresponding enable bit both to 
be 1; the Move To FPSCR instruction is considered to 
cause the enabled exception. 


The FEO and FE1 bits control whether and how the 
system floating-point enabled exception error handler 
is invoked if an enabled floating-point exception 
occurs. The location of these bits and the require- 
ments for altering them are described in Book III, 
PowerPC AS Operating Environment Architecture. 
(The system floating-point enabled exception error 
handler is never invoked because of a disabled float- 
ing-point exception.) The effects of the four possible 
settings of these bits are as follows. 


FEO FE1 Description 


O O Ignore Exceptions Mode 
Floating-point exceptions do not cause the 
system floating-point enabled exception 
error handler to be invoked. 


O 1 Imprecise Nonrecoverable Mode 

The system floating-point enabled exception 
error handler is invoked at some point at or 
beyond the instruction that caused the 
enabled exception. It may not be possible to 
identify the excepting instruction or the data 
that caused the exception. Results 
produced by the excepting instruction may 
have been used by or may have affected 
subsequent instructions that are executed 
before the error handler is invoked. 


1 O Imprecise Recoverable Mode 

The system floating-point enabled exception 
error handler is invoked at some point at or 
beyond the instruction that caused the 
enabled exception. Sufficient information is 
provided to the error handler that it can 
identify the excepting instruction and the 
operands, and correct the result. No results 
produced by the excepting instruction have 
been used by or have affected subsequent 
instructions that are executed before the 
error handler is invoked. 


1 1 Precise Mode 
The system floating-point enabled exception 
error handler is invoked precisely at the 
instruction that caused the enabled excep- 
tion. 
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Architecture Note 


The FEO and FE1 bits must be defined in Book III 
in a manner such that they can be changed 
dynamically and can easily be treated as part of a 
process' state. 


In all cases, the question of whether a floating-point 
result is stored, and what value is stored, is governed 
by the FPSCR exception enable bits, as described in 
subsequent sections, and is not affected by the value 
of the FEO and FE1 bits. 


In all cases in which the system floating-point enabled 
exception error handler is invoked, all instructions 
before the instruction at which the system floating- 
point enabled exception error handler is invoked have 
completed, and no instruction after the instruction at 
which the system floating-point enabled exception 
error handler is invoked has begun execution. (Recall 
that, for the two Imprecise modes, the instruction at 
which the system floating-point enabled exception 
error handler is invoked need not be the instruction 
that caused the exception.) The instruction at which 
the system floating-point enabled exception error 
handler is invoked has not been executed unless it is 
the excepting instruction, in which case it has been 
executed if the exception is not among those listed on 
page 90 as suppressed. 


Programming Note 


In any of the three non-Precise modes, a Floating- 
Point Status and Control Register instruction can 
be used to force any exceptions, due to 
instructions initiated before the Floating-Point 
Status and Control Register instruction, to be 
recorded in the FPSCR. (This forcing is super- 
fluous for Precise Mode.) 


In either of the Imprecise modes, a Floating-Point 
Status and Control Register instruction can be 
used to force any invocations of the system float- 
ing-point enabled exception error handler, due to 
instructions initiated before the Floating-Point 
Status and Control Register instruction, to occur. 
(This forcing has no effect in Ignore Exceptions 
Mode, and is superfluous for Precise Mode.) 


In order to obtain the best performance across the 
widest range of implementations, the programmer 
should obey the following guidelines. 


m |f the IEEE default results are acceptable to the 
application, Ignore Exceptions Mode should be 
used with all FPSCR exception enable bits set to 
0. 


m lí the IEEE default results are not acceptable to 
the application, Imprecise Nonrecoverable Mode 


should be used, or Imprecise Recoverable Mode 
if recoverability is needed, with FPSCR exception 
enable bits set to 1 for those exceptions for which 
the system floating-point enabled exception error 
handler is to be invoked. 


m Ignore Exceptions Mode should not, in general, be 
used when any FPSCR exception enable bits are 
set to 1. 


m Precise Mode may degrade performance in some 
implementations, perhaps substantially, and 
therefore should be used only for debugging and 
other specialized applications. 


Engineering Note 


It is permissible for the implementation to be 
precise in any of the three modes that permit 
interrupts, or to be recoverable in Nonrecoverable 
Mode. 


4.4.1 Invalid Operation Exception 


4.4.1.1 Definition 


An Invalid Operation Exception occurs when an 
operand is invalid for the specified operation. The 
invalid operations are: 


m Any floating-point operation on a Signaling NaN 
(SNaN) 

m For add or subtract operations, magnitude sub- 

traction of infinities (co — ee ) 

Division of infinity by infinity (co = ee ) 

Division of zero by zero (0+ 0) 

Multiplication of infinity by zero (co x0) 

Ordered comparison involving a NaN (Invalid 

Compare) 

= Square root or reciprocal square root of a nega- 
tive (and nonzero) number (Invalid Square Root) 

m Integer convert involving a number too large in 
magnitude to be represented in the target format, 
or involving an infinity or a NaN (Invalid Integer 
Convert) 


In addition, an Invalid Operation Exception occurs if 
software explicitly requests this by executing an 
mtfsfi, mtfsf, or mtfsb1 instruction that sets 
FPSCRyxsorr to 1 (Software Request). 


Programming Note 


The purpose of FPSCRyxsorr is to allow software 
to cause an Invalid Operation Exception for a con- 
dition that is not necessarily associated with the 
execution of a floating-point instruction. For 
example, it might be set by a program that com- 
putes a square root, if the source operand is neg- 
ative. 
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4.4.1.2 Action 


The action to be taken depends on the setting of the 
Invalid Operation Exception Enable bit of the FPSCR. 


When Invalid Operation Exception is enabled 
(FPSCRyg=1) and Invalid Operation occurs or soft- 
ware explicitly requests the exception, the following 
actions are taken: 


1. One or two Invalid Operation Exceptions are set 


FPSCRyxsNAN (if SNaN) 
FPSCRyxisi (if o — 00 ) 
FPSCRyxip1 (if co +00 ) 
FPSCRyxzpz (if 0+ 0) 
FPSCRyximz (if co x0) 
FPSCRyxvc (if invalid comp) 
FPSCRyxsorT (if software reg) 
FPSCRyxsarT (if invalid sqrt) 
FPSCRyxcv1 (if invalid int cvrt) 


2. If the operation is an arithmetic, Floating Round 
to Single-Precision, or convert to integer opera- 
tion, 

the target FPR is unchanged 
FPSCRra F are set to zero 
FPSCRrppF is unchanged 
3. If the operation is a compare, 
FPSCRr ri c are unchanged 
FPSCRepcc is set to reflect unordered 
4. If software explicitly requests the exception, 
FPSCRra fı cpap are as set by the mtfsfi, 
mtfsf, or mtfsb1 instruction 


When Invalid Operation Exception is disabled 
(FPSCRye=0) and Invalid Operation occurs or soft- 
ware explicitly requests the exception, the following 
actions are taken: 


1. One or two Invalid Operation Exceptions are set 


FPSCRyxsNAN (if SNaN) 
FPSCRyxisi (if 09; ="00: ) 
FPSCRyxipi (if co +00 ) 
FPSCRyxzpz (if 0+ 0) 
FPSCRyximz (if co x0) 
FPSCRyxvc (if invalid comp) 
FPSCRyxsorT (if software reg) 
FPSCRyxsarT (if invalid sqrt) 
FPSCRyxcv1 (if invalid int cvrt) 


2. If the operation is an arithmetic or Floating 
Round to Single-Precision operation, 
the target FPR is set to a Quiet NaN 
FPSCRep F are set to zero 
FPSCRrppF is set to indicate the class of the 
result (Quiet NaN) 
3. If the operation is a convert to 64-bit integer 
operation, 
the target FPR is set as follows: 
FRT is set to the most positive 64-bit 
integer if the operand in FRB is a posi- 
tive number or +œ, and to the most 
negative 64-bit integer if the operand in 
FRB is a negative number, — ee , or NaN 
FPSCRep F are set to zero 


FPSCRrppr¢ is undefined 
4. If the operation is a convert to 32-bit integer 
operation, 
the target FPR is set as follows: 
FRT 9.31 € undefined 
FRT30.63 are set to the most positive 
32-bit integer if the operand in FRB is a 
positive number or + o, and to the most 
negative 32-bit integer if the operand in 
FRB is a negative number, — oo, or NaN 
FPSCRep F are set to zero 
FPSCRrppF is undefined 
5. If the operation is a compare, 
FPSCRerp ri c are unchanged 
FPSCRepcc is set to reflect unordered 
6. If software explicitly requests the exception, 
FPSCRra fı cpap are as set by the mtfsfi, 
mtfsf, or mtfsb1 instruction 


4.4.2 Zero Divide Exception 
4.4.2.1 Definition 


A Zero Divide Exception occurs when a Divide instruc- 
tion is executed with a zero divisor value and a finite 
nonzero dividend value. It also occurs when a Recip- 
rocal Estimate instruction (fres or frsqrte) is executed 
with an operand value of zero. 


Architecture Note 


The name is a misnomer used for historical 
reasons. The proper name for this exception 


should be “Exact Infinite Result from Finite Oper- 
ands" corresponding to what mathematicians call 
a "pole". 


4.4.2.2 Action 


The action to be taken depends on the setting of the 
Zero Divide Exception Enable bit of the FPSCR. 


When Zero Divide Exception is enabled (FPSCRzgz 1) 
and Zero Divide occurs, the following actions are 
taken: 


1. Zero Divide Exception is set 
FPSCRzx € 1 

2. The target FPR is unchanged 

3. FPSCRep pı are set to zero 

4. FPSCRrprr is unchanged 


When Zero Divide Exception is disabled (FPSCRzg20) 
and Zero Divide occurs, the following actions are 
taken: 


1. Zero Divide Exception is set 
FPSCRzx € 1 
2. The target FPR is set to x Infinity, where the sign 
is determined by the XOR of the signs of the 
operands 
3. FPSCRra pı are set to zero 
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4. FPSCRrppr is set to indicate the class and sign of 
the result (+ Infinity) 


4.4.3 Overflow Exception 
4.4.3.1 Definition 


Overflow occurs when the magnitude of what would 
have been the rounded result if the exponent range 
were unbounded exceeds that of the largest finite 
number of the specified result precision. 


4.4.3.2 Action 


The action to be taken depends on the setting of the 
Overflow Exception Enable bit of the FPSCR. 


When Overflow Exception is enabled (FPSCRog- 1) 
and exponent overflow occurs, the following actions 
are taken: 


1. Overflow Exception is set 
FPSCRoy € 1 

2. For double-precision arithmetic instructions, the 
exponent of the normalized intermediate result is 
adjusted by subtracting 1536 

3. For single-precision arithmetic instructions and 
the Floating Round to Single-Precision instruc- 
tion, the exponent of the normalized intermediate 
result is adjusted by subtracting 192 

4. The adjusted rounded result is placed into the 
target FPR 

5. FPSCRrppF is set to indicate the class and sign of 
the result (x Normal Number) 


When Overflow Exception is disabled (FPSCRog-0) 
and overflow occurs, the following actions are taken: 


1. Overflow Exception is set 
FPSCRoy € 1 
2. Inexact Exception is set 
FPSCRxx € 1 
3. The result is determined by the rounding mode 
(FPSCRgy) and the sign of the intermediate result 
as follows: 
A. Round to Nearest 
Store + Infinity, where the sign is the sign of 
the intermediate result 
B. Round toward Zero 
Store the format's largest finite number with 
the sign of the intermediate result 
C. Round toward + Infinity 
For negative overflow, store the format's 
most negative finite number; for positive 
overflow, store + Infinity 
D. Round toward - Infinity 
For negative overflow, store — Infinity; for 
positive overflow, store the format's largest 
finite number 
4. The result is placed into the target FPR 


5. FPSCRra is undefined 

FPSCR- is set to 1 

7. FPSCRrppr is set to indicate the class and sign of 
the result (x Infinity or x Normal Number) 


D 


4.4.4 Underflow Exception 
4.4.4.1 Definition 


Underflow Exception is defined separately for the 
enabled and disabled states: 


m Enabled: 
Underflow occurs when the intermediate result is 
“Tiny”. 

m Disabled: 
Underflow occurs when the intermediate result is 
“Tiny” and there is “Loss of Accuracy”. 


A “Tiny” result is detected before rounding, when a 
nonzero intermediate result computed as though both 
the precision and the exponent range were 
unbounded would be less in magnitude than the 
smallest normalized number. 


If the intermediate result is “Tiny” and Underflow 
Exception is disabled (FPSCRyg=0) then the interme- 
diate result is denormalized (see Section 4.3.4, “Nor- 
malization and Denormalization” on page 87) and 
rounded (see Section 4.3.6, “Rounding” on page 88) 
before being placed into the target FPR. 


“Loss of Accuracy” is detected when the delivered 
result value differs from what would have been com- 
puted were both the precision and the exponent range 
unbounded. 


4.4.4.2 Action 


The action to be taken depends on the setting of the 
Underflow Exception Enable bit of the FPSCR. 


When Underflow Exception is enabled (FPSCRyg= 1) 
and exponent underflow occurs, the following actions 
are taken: 


1. Underflow Exception is set 
FPSCRux € 1 

2. For double-precision arithmetic instructions, the 
exponent of the normalized intermediate result is 
adjusted by adding 1536 

3. For single-precision arithmetic instructions and 
the Floating Round to Single-Precision instruc- 
tion, the exponent of the normalized intermediate 
result is adjusted by adding 192 

4. The adjusted rounded result is placed into the 
target FPR 

5. FPSCRrppF is set to indicate the class and sign of 
the result (+ Normalized Number) 
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Programming Note 


The FR and FI bits are provided to allow the 
system floating-point enabled exception error 
handler, when invoked because of an Underflow 
Exception, to simulate a “trap disabled” environ- 
ment. Thatis, the FR and FI bits allow the system 
floating-point enabled exception error handler to 
unround the result, thus allowing the result to be 
denormalized. 


When Underflow Exception is disabled (FPSCRyg=0) 
and underflow occurs, the following actions are taken: 


1. Underflow Exception is set 
FPSCRux + 1 
2. The rounded result is placed into the target FPR 
3. FPSCRrpprF is set to indicate the class and sign of 
the result (+ Normalized Number, + Denormalized 
Number, or + Zero) 


4.4.5 Inexact Exception 
4.4.5.1 Definition 


An Inexact Exception occurs when one of two condi- 
tions occur during rounding: 


1. The rounded result differs from the intermediate 
result assuming both the precision and the expo- 
nent range of the intermediate result to be 
unbounded. In this case the result is said to be 
inexact. (If the rounding causes an enabled Over- 
flow Exception or an enabled Underflow Excep- 
tion, an Inexact Exception also occurs only if the 
significands of the rounded result and the inter- 
mediate result differ.) 

2. The rounded result overflows and Overflow 
Exception is disabled. 


4.4.5.2 Action 


The action to be taken does not depend on the setting 
of the Inexact Exception Enable bit of the FPSCR. 


When Inexact Exception occurs, the following actions 
are taken: 


1. Inexact Exception is set 
FPSCRxx € 1 
2. The rounded or overflowed result is placed into 
the target FPR 
3. FPSCRrppF is set to indicate the class and sign of 
the result 


Programming Note 


In some implementations, Inexact 


enabling 
Exceptions may degrade performance more than 
does enabling other types of floating-point excep- 
tion. 


4.5 Floating-Point Execution 
Models 


All implementations of this architecture must provide 
the equivalent of the following execution models to 
ensure that identical results are obtained. 


Special rules are provided in the definition of the 
computational instructions for the infinities, denormal- 
ized numbers and NaNs. The material in the 
remainder of this section applies to instructions that 
have numeric operands and a numeric result (i.e., 
operands and result that are not infinities or NaNs), 
and that cause no exceptions. See Section 4.3.2, 
"Value Representation" on page 85 and Section 4.4, 
“Floating-Point Exceptions" on page 89 for the cases 
not covered here. 


Although the double format specifies an 11-bit expo- 
nent, exponent arithmetic makes use of two additional 
bits to avoid potential transient overflow conditions. 
One extra bit is required when denormalized double- 
precision numbers are prenormalized. The second bit 
is required to permit the computation of the adjusted 
exponent value in the following cases when the corre- 
sponding exception enable bit is 1: 


m Underflow during multiplication using a denormal- 
ized operand. 

m Overflow during division using a denormalized 
divisor. 


The IEEE standard includes 32-bit and 64-bit arith- 
metic. The standard requires that single-precision 
arithmetic be provided for single-precision operands. 
The standard permits double-precision floating-point 
operations to have either (or both) single-precision or 
double-precision operands, but states that single- 
precision floating-point operations should not accept 
double-precision operands. The PowerPC AS Archi- 
tecture follows these guidelines; double-precision 
arithmetic instructions can have operands of either or 
both precisions, while single-precision arithmetic 
instructions require all operands to be single- 
precision. Double-precision arithmetic instructions 
and fcfid produce double-precision values, while 
single-precision arithmetic instructions produce 
single-precision values. 


For arithmetic instructions, conversions from double- 
precision to single-precision must be done explicitly 
by software, while conversions from single-precision 
to double-precision are done implicitly. 
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4.5.1 Execution Model for IEEE 
Operations 


The following description uses 64-bit arithmetic as an 
example. 32-bit arithmetic is similar except that the 
FRACTION is a 23-bit field, and the single-precision 
Guard, Round, and Sticky bits (described in this 
section) are logically adjacent to the 23-bit FRACTION 
field. 


IEEE-conforming significand arithmetic is considered 
to be performed with a floating-point accumulator 
having the following format, where bits 0:55 comprise 
the significand of the intermediate result. 


S|C|L FRACTION G|R|X 
01 52 55 


Figure 34. IEEE 64-bit execution model 
The S bit is the sign bit. 


The C bit is the carry bit, which captures the carry out 
of the significand. 


The L bit is the leading unit bit of the significand, 
which receives the implicit bit from the operand. 


The FRACTION is a 52-bit field that accepts the frac- 
tion of the operand. 


The Guard (G), Round (R), and Sticky (X) bits are 
extensions to the low-order bits of the accumulator. 
The G and R bits are required for postnormalization of 
the result. The G, R, and X bits are required during 
rounding to determine if the intermediate result is 
equally near the two nearest representable values. 
The X bit serves as an extension to the G and R bits 
by representing the logical OR of all bits that may 
appear to the low-order side of the R bit, due either to 
shifting the accumulator right or to other generation 
of low-order result bits. The G and R bits participate 
in the left shifts with zeros being shifted into the R bit. 
Figure 35 shows the significance of the G, R, and X 
bits with respect to the intermediate result (IR), the 
representable number next lower in magnitude (NL), 
and the representable number next higher in magni- 
tude (NH). 


IR midway between NL and NH 


IR closer to NH 


Figure 35. Interpretation of G, R, and X bits 


After normalization, the intermediate result is 
rounded, using the rounding mode specified by 
FPSCRpn. If rounding results in a carry into C, the 
significand is shifted right one position and the expo- 
nent incremented by one. This yields an inexact 
result and possibly also exponent overflow. Fraction 
bits to the left of the bit position used for rounding 
are stored into the FPR and low-order bit positions, if 
any, are set to zero. 


Four user-selectable rounding modes are provided 
through FPSCRay as described in Section 4.3.6, 
“Rounding” on page 88. For rounding, the conceptual 
Guard, Round, and Sticky bits are defined in terms of 
accumulator bits. Figure 36 shows the positions of 
the Guard, Round, and Sticky bits for double-precision 
and single-precision floating-point numbers in the 
IEEE execution model. 


[Guard [Round [Sig — —  — 


Double |G bit R bit 
Single 24 25 


X bit 
OR of 26:52, G, R, X 


Figure 36. Location of the Guard, Round, and Sticky 
bits in the IEEE execution model 


Rounding can be treated as though the significand 
were shifted right, if required, until the least signif- 
icant bit to be retained is in the low-order bit position 
of the FRACTION. If any of the Guard, Round, or 
Sticky bits is nonzero, then the result is inexact. 


Z1 and Z2, as defined on page 89, can be used to 
approximate the result in the target format when one 
of the following rules is used. 


m Round to Nearest 


Guard bit = 0 
The result is truncated. (Result exact (GRX = 
000) or closest to next lower value in magni- 
tude (GRX = 001, 010, or 011)) 


Guard bit = 1 
Depends on Round and Sticky bits: 


Case a 
If the Round or Sticky bit is 1 (inclusive), 
the result is incremented. (Result closest 
to next higher value in magnitude (GRX 
= 101, 110, or 111)) 


Case b 

If the Round and Sticky bits are O (result 
midway between closest representable 
values), then if the low-order bit of the 
result is 1 the result is incremented. 
Otherwise (the low-order bit of the result 
is 0) the result is truncated (this is the 
case of a tie rounded to even). 


m Round toward Zero 
Choose the smaller in magnitude of Z1 or Z2. If 
the Guard, Round, or Sticky bit is nonzero, the 
result is inexact. 
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m Round toward + Infinity 
Choose Z1. 


m Round toward - Infinity 
Choose Z2. 


Where the result is to have fewer than 53 bits of pre- 
cision because the instruction is a Floating Round to 
Single-Precision or single-precision arithmetic instruc- 
tion, the intermediate result is either normalized or 
placed in correct denormalized form before being 
rounded. 


4.5.2 Execution Model for 
Multiply-Add Type Instructions 


The PowerPC AS Architecture provides a special form 
of instruction that performs up to three operations in 
one instruction (a multiplication, an addition, and a 
negation). With this added capability comes the 
special ability to produce a more exact intermediate 
result as input to the rounder. 32-bit arithmetic is 
similar except that the FRACTION field is smaller. 


Multiply-add significand arithmetic is considered to be 
performed with a floating-point accumulator having 
the following format, where bits 0:106 comprise the 
significand of the intermediate result. 


S|C|L FRACTION X' 
0 1 105 106 


Figure 37. Multiply-add 64-bit execution model 


The first part of the operation is a multiplication. The 
multiplication has two 53-bit significands as inputs, 
which are assumed to be prenormalized, and 
produces a result conforming to the above model. If 
there is a carry out of the significand (into the C bit), 
then the significand is shifted right one position, 
shifting the L bit (leading unit bit) into the most signif- 


icant bit of the FRACTION and shifting the C bit (carry 
out) into the L bit. All 106 bits (L bit, the FRACTION) 
of the product take part in the add operation. If the 
exponents of the two inputs to the adder are not 
equal, the significand of the operand with the smaller 
exponent is aligned (shifted) to the right by an amount 
that is added to that exponent to make it equal to the 
other input's exponent. Zeros are shifted into the left 
of the significand as it is aligned and bits shifted out 
of bit 105 of the significand are ORed into the X' bit. 
The add operation also produces a result conforming 
to the above model with the X' bit taking part in the 
add operation. 


The result of the addition is then normalized, with all 
bits of the addition result, except the X' bit, partic- 
ipating in the shift. The normalized result serves as 
the intermediate result that is input to the rounder. 


For rounding, the conceptual Guard, Round, and 
Sticky bits are defined in terms of accumulator bits. 
Figure 38 shows the positions of the Guard, Round, 
and Sticky bits for double-precision and single- 
precision floating-point numbers in the multiply-add 
execution model. 


Format [Guard [rouna [oy — — — 


Double |53 54 OR of 55:105, X' 
Single |24 25 OR of 26:105, X' 


Figure 38. Location of the Guard, Round, and Sticky 
bits in the multiply-add execution model 


The rules for rounding the intermediate result are the 
same as those given in Section 4.5.1, "Execution 
Model for IEEE Operations" on page 95. 


If the instruction is Floating Negative Multiply-Add or 
Floating Negative Multiply-Subtract, the final result is 
negated. 
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4.6 Floating-Point Processor Instructions 


2 — —— Architecture Note tA ————— —— — — — — — — — ——4 


The rules followed in assigning new primary and 
extended opcodes, for instructions that are not in 
the POWER Architecture, are the following. 


1. 


A new primary opcode, 59, has been added. It 
is used for the single-precision arithmetic 
instructions. 


The single-precision instructions for which there 
is a corresponding double-precision instruction 
have the same format and extended opcode as 
that double-precision instruction. 


In assigning new extended opcodes for primary 
opcode 63, the following regularities, present in 
the POWER Architecture, have been maintained. 
In addition, all new X-form instructions in 
primary opcode 63 have bits 21:22 = 0b11, 
which distinguishes them from the X-form 
instructions present in POWER Architecture. 


m Bit 26 = 1 iff the instruction is A-form. 


m Bits 26:29 = 0b0000 iff the instruction is a 
comparison or merfs (i.e., iff the instruction 
sets an explicitly-designated CR field). 


m Bits 26:28 = 0b001 iff the instruction explic- 
itly refers to or sets the FPSCR (i.e., is a 
Floating-Point Status and Control Register 
instruction) and is not mcrfs. 


m Bits 26:30 = 0b01000 iff the instruction is a 
Move Register instruction, or any other 
instruction that does not refer to or set the 
FPSCR. 


4. 


In assigning extended opcodes for primary 
opcode 59, the following regularities have been 
maintained. They are based on those rules for 
primary opcode 63 that apply to the instructions 
having primary opcode 59. In particular, 
primary opcode 59 has no Floating-Point Status 
and Control Register instructions, so the corre- 
sponding rule does not apply. 


m |f there is a corresponding instruction with 
primary opcode 63, its extended opcode is 


used. 
m Bit 26 = 1 iff the instruction is A-form. 
m Bits 26:30 = 0b01000 iff the instruction is a 


Move Register instruction, or any other 
instruction that does not refer to or set the 
FPSCR. 
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4.6.1 Floating-Point Storage Access Instructions 


The Storage Access instructions compute the effective 
address (EA) of the storage to be accessed as 
described in Section 1.12.2, “Effective Address 
Calculation” on page 15. 


Programming Note 


The la extended mnemonic permits computing an 
effective address as a Load or Store instruction 
would, but loads the address itself into a GPR 
rather than loading the value that is in storage at 
that address. This extended mnemonic is 
described in Section  B.9, “Miscellaneous 


Mnemonics” on page 153. 


4.6.2 Floating-Point Load Instructions 


There are two basic forms of load instruction: single- 
precision and double-precision. Because the FPRs 
support only floating-point double format, single- 
precision Load Floating-Point instructions convert 
single-precision data to double format prior to loading 
the operand into the target FPR. The conversion and 
loading steps are as follows. 


Let WORDo.3; be the floating-point single-precision 
operand accessed from storage. 


Normalized Operand 
if WORD}. > 0 and WORD, < 255 then 
FRT 9.1 € WORD,.; 
FRT, € “WORD, 
FRT¿ € “WORD, 
FRT, € “WORD, 
FRT5.¢3 € WORD», | 790 


Denormalized Operand 
if WORD;. = 0 and WORDg,, + 0 then 
sign + WORD), 
exp € — 126 
fraCo:s2 + 0b0 | WORDs:3; | 290 
normalize the operand 
do while fraco = 0 
frac € frac; .52 | 0b0 
exp € exp - 1 
FRTo € sign 
FRT4-44 + exp + 1023 
FRT4283 € frac» 


4.6.1.1 Storage Access Exceptions 


Storage accesses will cause the system data storage 
error handler to be invoked if the program is not 
allowed to modify the target storage (Store only), or if 
the program attempts to access storage that is una- 
vailable. 


Zero / Infinity / NaN 
if WORD}. = 255 or WORD,.3; = Othen 
FRTo4 € WORD. 
FRT> € WORD, 
FRT3 € WORD, 
FRT, € WORD, 
FRT5.93 € WORD2:1 | 20 


Engineering Note 


The above description of the conversion steps is a 
model only. The actual implementation may vary 
from this but must produce results equivalent to 
what this model would produce. 


For double-precision Load Floating-Point instructions 
no conversion is required, as the data from storage 
are copied directly into the FPR. 


Many of the Load Floating-Point instructions have an 
“update” form, in which register RA is updated with 
the effective address. For these forms, if RAz0, the 
effective address is placed into register RA and the 
storage element (word or doubleword) addressed by 
EA is loaded into FRT. 


Note: Recall that RA and RB denote General Purpose 
Registers, while FRT denotes a Floating-Point Reg- 
ister. 
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Load Floating-Point Single D-form 


Ifs FRT,D(RA) 


e 0 
else € (RA) 
EA € b + EXTS(D o 

FRT € DOUBLE (MEM(EA, 4)) 


Let the effective address (EA) be the sum (RA|0)+D 
The word in storage addressed by EA is interpreted 
as a floating-point single-precision operand. This 
word is converted to floating-point double format (see 
page 98) and placed into register FRT. 


Special Registers Altered: 
None 


Load Floating-Point Single with Update 
D-form 


Ifsu FRT,D(RA) 


EA € (RA) + EXTS(D) 
FRI € DOUBLE (MEM(EA, 4) ) 
RA € EA 


Let the effective address (EA) be the sum (RA)+D. 
The word in storage addressed by EA is interpreted 
as a floating-point single-precision operand. This 
word is converted to floating-point double format (see 
page 98) and placed into register FRT. 

EA is placed into register RA. 


If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 


Load Floating-Point Single Indexed 
X-form 


Ifsx FRT,RA,RB 


31 FRT RA RB 535 / 
0 6 11 16 21 31 


ee 0 
else € (RA) 
EA € b + (RB) 

FRT € DOUBLE (MEM(EA, 4)) 


Let the effective address (EA) be the sum 
(RAJO)+(RB). 

The word in storage addressed by EA is interpreted 
as a floating-point single-precision operand. This 
word is converted to floating-point double format (see 
page 98) and placed into register FRT. 


Special Registers Altered: 
None 


Load Floating-Point Single with Update 
Indexed X-form 


Ifsux FRT,RA,RB 


31 FRT RA RB 567 / 
0 6 11 16 21 31 


EA € (RA) + (RB) 

FRT € DOUBLE (MEM(EA, 4)) 

RA € EA 

Let the effective address (EA) be the sum (RA)+(RB). 
The word in storage addressed by EA is interpreted 
as a floating-point single-precision operand. This 
word is converted to floating-point double format (see 
page 98) and placed into register FRT. 

EA is placed into register RA. 

If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 
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Load Floating-Point Double D-form 


lfd FRT,D(RA) 


50 FRT RA D 


if RA = 0 then b ¢ 
else b € (RA) 

EA € b + EXTS(D) 

FRT € MEM(EA, 8) 

Let the effective address (EA) be the sum (RA|0)+D. 


The doubleword in storage addressed by EA is placed 
into register FRT. 


Special Registers Altered: 
None 


Load Floating-Point Double with Update 
D-form 


lfdu FRT,D(RA) 


51 FRT RA D 
0 6 11 16 31 


EA € (RA) + EXTS(D) 

FRT € MEM(EA, 8) 

RA € EA 

Let the effective address (EA) be the sum (RA)+D. 


The doubleword in storage addressed by EA is placed 
into register FRT. 


EA is placed into register RA. 
If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 
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Load Floating-Point Double Indexed 
X-form 


Ifdx FRT,RA,RB 


31 FRT RA RB 599 / 
0 6 11 16 21 31 


if RA = 0 then b €0 
else b € (RA) 
EA € b + (RB) 

FRT € MEM(EA, 8) 


Let the effective address (EA) be the sum 
(RAJO)+(RB). 


The doubleword in storage addressed by EA is placed 
into register FRT. 


Special Registers Altered: 
None 


Load Floating-Point Double with Update 
Indexed X-form 


Ifdux FRT,RA,RB 


31 FRT RA RB 631 / 
0 6 11 16 21 31 


EA € (RA) + (RB) 

FRT € MEM(EA, 8) 

RA € EA 

Let the effective address (EA) be the sum (RA)+(RB). 


The doubleword in storage addressed by EA is placed 
into register FRT. 


EA is placed into register RA. 
If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 
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4.6.3 Floating-Point Store Instructions 


There are three basic forms of store instruction: 
single-precision, double-precision, and integer. The 
integer form is provided by the Store Floating-Point as 
Integer Word instruction, described on page 104. 
Because the FPRs support only floating-point double 
format for floating-point data, single-precision Store 
Floating-Point instructions convert double-precision 
data to single format prior to storing the operand into 
storage. The conversion steps are as follows. 


Let WORD .3; be the word in storage written to. 


No Denormalization Required (includes Zero / Infinity 
/ NaN) 
if FRS4.1 > 896 or FRS4.63 = 0 then 

WORDo:4 € FRS94 

WORD2:31 € FRS5:34 


Denormalization Required 
if 874 < FRS,44 < 896 then 
sign € FRSy 
exp € FRS,.,, — 1023 
frac € 0b1 | FRS45.5 
denormalize operand 
do while exp « — 126 
frac € 0b0 | fracg.go 
exp € exp + 1 
WORD); € sign 
WORD,.g € 0x00 
WORDg;3, € fracy.o3 
else WORD € undefined 


Notice that if the value to be stored by a single- 
precision Store Floating-Point instruction is larger in 
magnitude than the maximum number representable 
in single format, the first case above (No Denormal- 
ization Required) applies. The result stored in WORD 
is then a well-defined value, but is not numerically 
equal to the value in the source register (i.e., the 
result of a single-precision Load Floating-Point from 
WORD will not compare equal to the contents of the 
original source register). 


Engineering Note 


The above description of the conversion steps is a 
model only. The actual implementation may vary 
from this but must produce results equivalent to 
what this model would produce. 


For double-precision Store Floating-Point instructions 
and for the Store Floating-Point as Integer Word 
instruction no conversion is required, as the data 
from the FPR are copied directly into storage. 


Many of the Store Floating-Point instructions have an 
"update" form, in which register RA is updated with 
the effective address. For these forms, if RA#0, the 
effective address is placed into register RA. 


Note: Recall that RA and RB denote General Purpose 
Registers, while FRS denotes a Floating-Point Reg- 
ister. 


Chapter 4. Floating-Point Processor 101 


Store Floating-Point Single D-form 


stfs FRS,D(RA) 


52 FRS RA D 


if RA = 0 then b 
else b 
EA € b + EXTS(D) 
MEM (EA, 4) € SINGLE ((FRS)) 


«0 
+í 


RA) 

Let the effective address (EA) be the sum (RA|0)+D. 
The contents of register FRS are converted to single 
format (see page 101) and stored into the word in 


storage addressed by EA. 


Special Registers Altered: 
None 


Store Floating-Point Single with Update 
D-form 


stfsu FRS,D(RA) 


53 FRS RA D 
0 6 11 16 31 


EA € (RA) + EXTS(D) 

MEM (EA, 4) € SINGLE ((FRS)) 

RA € EA 

Let the effective address (EA) be the sum (RA)+D. 
The contents of register FRS are converted to single 
format (see page 101) and stored into the word in 
storage addressed by EA. 

EA is placed into register RA. 


If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 
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Store Floating-Point Single Indexed 
X-form 


stfsx FRS,RA,RB 


31 FRS RA RB 663 / 
0 6 11 16 21 31 


if RA = 0 then b €0 

else b € (RA) 
EA € b + (RB) 

MEM (EA, 4) € SINGLE ((FRS)) 


Let the effective address (EA) be the sum 
(RAJO)+(RB). 


The contents of register FRS are converted to single 
format (see page 101) and stored into the word in 
storage addressed by EA. 


Special Registers Altered: 
None 


Store Floating-Point Single with Update 
Indexed X-form 


stfsux FRS,RA,RB 


31 FRS RA RB 695 / 
0 6 11 16 21 31 


EA € (RA) + (RB) 

MEM (EA, 4) € SINGLE ((FRS)) 

RA € EA 

Let the effective address (EA) be the sum (RA)+(RB). 
The contents of register FRS are converted to single 
format (see page 101) and stored into the word in 
storage addressed by EA. 

EA is placed into register RA. 


If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 
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Store Floating-Point Double D-form 


stfd FRS,D(RA) 


FRS 
31 


REL d Pi 0 

else € (RA) 

EA € b + EXTS(D 3 

MEM (EA, 8) € (FRS) 

Let the effective address (EA) be the sum (RA|0)+D 


The contents of register FRS are stored into the 
doubleword in storage addressed by EA. 


Special Registers Altered: 
None 


Store Floating-Point Double with Update 
D-form 


stfdu FRS,D(RA) 


FRS 
31 


EA € (RA) + EXTS(D) 

MEM (EA, 8) € (FRS) 

RA € EA 

Let the effective address (EA) be the sum (RA)+D. 


The contents of register FRS are stored into the 
doubleword in storage addressed by EA. 


EA is placed into register RA. 
If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 


Store Floating-Point Double Indexed 
X-form 


stfdx FRS,RA,RB 


31 FRS RA RB 727 / 
0 6 11 16 21 31 


MD QE 0 
else « (RA) 
EA € b + (RB) 

MEM(EA, 8) « (FRS) 


Let the effective 
(RA|0) - (RB). 


address (EA) be the sum 
The contents of register FRS are stored into the 
doubleword in storage addressed by EA. 


Special Registers Altered: 
None 


Store Floating-Point Double with Update 
Indexed X-form 


stfdux FRS,RA,RB 


31 FRS RA RB 759 / 
0 6 11 16 21 31 


EA € (RA) + (RB) 

MEM (EA, 8) € (FRS) 

RA € EA 

Let the effective address (EA) be the sum (RA)+(RB). 


The contents of register FRS are stored into the 
doubleword in storage addressed by EA. 


EA is placed into register RA. 
If RA=0, the instruction form is invalid. 


Special Registers Altered: 
None 
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Store Floating-Point as Integer Word 
Indexed X-form 


stfiwx FRS,RA,RB 


31 FRS RA RB 983 / 
0 6 11 16 21 31 


if RA = 0 then b €0 
else b € (RA) 
EA € b + (RB) 

MEM(EA, 4) € (FRS) 30:63 


Let the effective address (EA) be the sum 
(RA[0) - (RB). 


The contents of the low-order 32 bits of register FRS 
are stored, without conversion, into the word in 
storage addressed by EA. 


If the contents of register FRS were produced, either 
directly or indirectly, by a Load Floating-Point Single 
instruction, a single-precision Arithmetic instruction, 
or frsp, then the value stored is undefined. (The con- 
tents of register FRS are produced directly by such an 
instruction if FRS is the target register for the instruc- 
tion. The contents of register FRS are produced indi- 
rectly by such an instruction if FRS is the final target 
register of a sequence of one or more Floating-Point 
Move instructions, with the input to the sequence 
having been produced directly by such an instruction.) 


Special Registers Altered: 
None 


Architecture Note 


Allowing the value stored to be undefined if the 
input to stfiwx was produced by a single- 
precision-producing instruction (i.e, a Load 
Floating-Point Single instruction, a single- 
precision arithmetic instruction, or frsp) seems 
gratuitous at the architectural level. The back- 
ground and reasons for allowing it are as follows. 


m The implementors agreed to support stfiwx 
partly because they understood it to be easy 
to implement. 


= In some implementations (e.g., those that 
keep single-precision numbers in registers in 
a non-architected format), storing the archi- 
tected low-order 32 bits of a register that was 
set by a single-precision-producing instruction 
may be harder (and slower, and more trouble 
to verify) than simply storing whatever 
happens to be in the low-order 32 bits of the 
register. 


= Software can think of no use for storing the 
low-order 32 bits of the result of a single- 
precision producing instruction. 
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4.6.4 Floating-Point Move Instructions 


These instructions copy data from one floating-point 
register to another, altering the sign bit (bit 0) as 
described below for fneg, fabs, and fnabs. These 
instructions treat NaNs just like any other kind of 


Floating Move Register X-form 


fmr FRT,FRB 
fmr. FRT,FRB 


63 FRT Il FRB 72 Rc 
0 6 11 16 21 31 


The contents of register FRB are placed into register 
FRT. 


(Rc=0) 
(Re=1) 


Special Registers Altered: 
CR1 (if Rc=1) 


Floating Absolute Value X-form 


fabs FRT,FRB (Rc=0) 
fabs. FRT,FRB (Rc=1) 

63 FRT | /// | FRB 264 Re 
0 6 11 16 21 31 


The contents of register FRB with bit 0 set to zero are 
placed into register FRT. 


Special Registers Altered: 
CR1 (if Rc=1) 


value (e.g., the sign bit of a NaN may be altered by 
fneg, fabs, and fnabs). These instructions do not alter 
the FPSCR. 


Floating Negate X-form 


fneg FRT,FRB 
fneg. FRT,FRB 


63 FRT Il FRB 40 Rc 
0 6 11 16 21 31 


The contents of register FRB with bit O inverted are 
placed into register FRT. 


Special Registers Altered: 
CR1 (if Rc=1) 


Floating Negative Absolute Value 
X-form 


FRT,FRB 
FRT,FRB 


fnabs 
fnabs. 


63 FRT Il FRB 136 Rc 
0 6 11 16 21 31 


The contents of register FRB with bit O set to one are 
placed into register FRT. 


Special Registers Altered: 
CR1 (if Re=1) 
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4.6.5 Floating-Point Arithmetic Instructions 


4.6.5.1 Floating-Point Elementary Arithmetic Instructions 


Floating Add [ Single] A-form 


fadd FRT,FRA,FRB 
fadd. FRT,FRA,FRB 


[POWER mnemonics: fa, fa.] 


63 FRT FRA FRB Hl 21 |Rc 
0 6 11 16 21 26 31 


fadds FRT,FRA,FRB (Re=0) 
fadds. FRT,FRA,FRB (Rc=1) 

59 FRT | FRA | FRB | /// | 21 [Re 
0 6 11 16 21 26 31 


The floating-point operand in register FRA is added to 
the floating-point operand in register FRB. 


If the most significant bit of the resultant significand is 
not 1, the result is normalized. The result is rounded 
to the target precision under control of the Floating- 
Point Rounding Control field RN of the FPSCR and 
placed into register FRT. 


Floating-point addition is based on exponent compar- 
ison and addition of the two significands. The expo- 
nents of the two operands are compared, and the 
significand accompanying the smaller exponent is 
shifted right, with its exponent increased by one for 
each bit shifted, until the two exponents are equal. 
The two significands are then added or subtracted as 
appropriate, depending on the signs of the operands, 
to form an intermediate sum. All 53 bits of the 
significand as well as all three guard bits (G, R, and 
X) enter into the computation. 


If a carry occurs, the sum's significand is shifted right 
one bit position and the exponent is increased by one. 


FPSCRrprr is set to the class and sign of the result, 


except for Invalid Operation Exceptions when 
FPSCRyg= 1 . 
Special Registers Altered: 

FPRF FR FI 

FX OX UX XX 

VXSNAN VXISI 

CR1 (if Rc=1) 


Floating Subtract [ Single] A-form 


fsub FRT,FRA,FRB 
fsub. FRT,FRA,FRB 


[POWER mnemonics: fs, fs.] 


63 FRT FRA FRB Il 20 |Rc 
0 6 11 16 21 26 31 


fsubs FRT,FRA,FRB (Re=0) 
fsubs. FRT,FRA,FRB (Rc=1) 

59 FRT | FRA | FRB | /// | 20 [Re 
0 6 11 16 21 26 31 


The floating-point operand in register FRB is sub- 
tracted from the floating-point operand in register 
FRA. 


If the most significant bit of the resultant significand is 
not 1, the result is normalized. The result is rounded 
to the target precision under control of the Floating- 
Point Rounding Control field RN of the FPSCR and 
placed into register FRT. 


The execution of the Floating Subtract instruction is 
identical to that of Floating Add, except that the con- 
tents of FRB participate in the operation with the sign 
bit (bit 0) inverted. 


FPSCRrprr is set to the class and sign of the result, 


except for Invalid Operation Exceptions when 
FPSCRyg- 1. 
Special Registers Altered: 

FPRF FR FI 

FX OX UX XX 

VXSNAN VXISI 

CR1 (if Rc=1) 
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Floating Multiply [Single] A-form 


fmul FRT,FRA,FRC 
fmul. FRT,FRA,FRC 


[POWER mnemonics: fm, fm.] 


63 FRT FRA Il FRC 25 |Rc 
0 6 11 16 21 26 31 


fmuls FRT,FRA,FRC 
fmuls. FRT,FRA,FRC 


59 FRT FRA Il FRC 25 |Rc 
0 6 11 16 21 26 31 


The floating-point operand in register FRA is multi- 
plied by the floating-point operand in register FRC. 


If the most significant bit of the resultant significand is 
not 1, the result is normalized. The result is rounded 
to the target precision under control of the Floating- 
Point Rounding Control field RN of the FPSCR and 
placed into register FRT. 


Floating-point multiplication is based on exponent 
addition and multiplication of the significands. 


FPSCReppF is set to the class and sign of the result, 


except for Invalid Operation Exceptions when 
FPSCRyg= 1. 
Special Registers Altered: 

FPRF FR FI 

FX OX UX XX 

VXSNAN VXIMZ 

CR1 (if Rc=1) 


Floating Divide [ Single] A-form 


fdiv FRT,FRA,FRB 
fdiv. FRT,FRA,FRB 


[POWER mnemonics: fd, fd.] 


63 FRT FRA FRB Il 18 |Rc 
0 6 11 16 21 26 31 


fdivs FRT,FRA,FRB 
fdivs. FRT,FRA,FRB 


59 FRT FRA FRB Il 18 |Rc 
0 6 11 16 21 26 31 


The floating-point operand in register FRA is divided 
by the floating-point operand in register FRB. The 
remainder is not supplied as a result. 


If the most significant bit of the resultant significand is 
not 1, the result is normalized. The result is rounded 
to the target precision under control of the Floating- 
Point Rounding Control field RN of the FPSCR and 
placed into register FRT. 


Floating-point division is based on exponent sub- 
traction and division of the significands. 


FPSCRrppF is set to the class and sign of the result, 


except for Invalid Operation Exceptions when 
FPSCRye=1 and Zero Divide Exceptions when 
FPSCRzg-1. 


Special Registers Altered: 
FPRF FR FI 
FX OX UX ZX XX 
VXSNAN VXIDI VXZDZ 
CR1 (if Rc=1) 
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4.6.5.2 Floating-Point Multiply-Add Instructions 


These instructions combine a multiply and an add 
operation without an intermediate rounding operation. 
The fraction part of the intermediate product is 106 
bits wide (L bit, FRACTION), and all 106 bits take part 
in the add/subtract portion of the instruction. 


Status bits are set as follows. 


m Overflow, Underflow, and Inexact Exception bits, 
the FR and FI bits, and the FPRF field are set 


Floating Multiply-Add [Single] A-form 


fmadd FRT,FRA,FRC,FRB (Rc=0) 
fmadd. FRT,FRA,FRC,FRB (Rc=1) 
[POWER mnemonics: fma, fma.] 

63 FRT FRA FRB FRC 29 |Rc 
0 6 11 16 21 26 31 
fmadds FRT,FRA,FRC,FRB (Rc=0) 
fmadds. FRT,FRA,FRC,FRB (Rc=1) 


59 FRT FRA FRB FRC 29 |Rc 
0 6 11 16 21 26 31 


The operation 
FRT € [(FRA)x(FRC)] + (FRB) 
is performed. 


The floating-point operand in register FRA is multi- 
plied by the floating-point operand in register FRC. 
The floating-point operand in register FRB is added to 
this intermediate result. 


If the most significant bit of the resultant significand is 
not 1, the result is normalized. The result is rounded 
to the target precision under control of the Floating- 
Point Rounding Control field RN of the FPSCR and 
placed into register FRT. 


FPSCRrprç is set to the class and sign of the result, 


except for Invalid Operation Exceptions when 
FPSCRyg= 1. 
Special Registers Altered: 

FPRF FR FI 

FX OX UX XX 

VXSNAN VXISI VXIMZ 

CR1 (if Rc=1) 


based on the final result of the operation, and not 
on the result of the multiplication. 


m Invalid Operation Exception bits are set as if the 
multiplication and the addition were performed 
using two separate instructions (fmul[s], followed 
by fadd[s] or fsub[s]). That is, multiplication of 
infinity by O or of anything by an SNaN, and/or 
addition of an SNaN, cause the corresponding 
exception bits to be set. 


Floating Multiply-Subtract [ Single] 
A-form 


fmsub 
fmsub. 


FRT,FRA,FRC,FRB (Rc=0) 
FRT,FRA,FRC,FRB (Rc=1) 


[POWER mnemonics: fms, fms.] 


63 FRT FRA FRB FRC 28 |Rc 
0 6 11 16 21 26 31 


FRT,FRA,FRC,FRB 
FRT,FRA,FRC,FRB 


fmsubs 
fmsubs. 


59 FRT FRA FRB FRC 28 |Rc 
0 6 11 16 21 26 31 


The operation 
FRT € [(FRA)x(FRC)] — (FRB) 
is performed. 


The floating-point operand in register FRA is multi- 
plied by the floating-point operand in register FRC. 
The floating-point operand in register FRB is sub- 
tracted from this intermediate result. 


If the most significant bit of the resultant significand is 
not 1, the result is normalized. The result is rounded 
to the target precision under control of the Floating- 
Point Rounding Control field RN of the FPSCR and 
placed into register FRT. 


FPSCRrppF is set to the class and sign of the result, 


except for Invalid Operation Exceptions when 
FPSCRyg= 1. 
Special Registers Altered: 

FPRE FR Fl 

FX OX UX XX 

VXSNAN VXISI VXIMZ 

CR1 (if Rc=1) 
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Floating Negative Multiply-Add [ Single] 
A-form 


fnmadd FRT,FRA,FRC,FRB (Rc=0) 
fnmadd. FRT,FRA,FRC,FRB (Rc=1) 
[POWER mnemonics: fnma, fnma.] 

63 FRT FRA FRB FRC 31 |Rc 
0 6 11 16 21 26 31 
fnmadds FRT,FRA,FRC,FRB (Rc=0) 
fnmadds. FRT,FRA,FRC,FRB (Rc=1) 


59 FRT FRA FRB FRC 31 |Rc 
0 6 11 16 21 26 31 


The operation 
FRT € — ([(FRA)x(FRC)] + (FRB) ) 
is performed. 


The floating-point operand in register FRA is multi- 
plied by the floating-point operand in register FRC. 
The floating-point operand in register FRB is added to 
this intermediate result. 


If the most significant bit of the resultant significand is 
not 1, the result is normalized. The result is rounded 
to the target precision under control of the Floating- 
Point Rounding Control field RN of the FPSCR, then 
negated and placed into register FRT. 


This instruction produces the same result as would be 
obtained by using the Floating Multiply-Add instruc- 
tion and then negating the result, with the following 
exceptions. 


m QNaNs propagate with no effect on their “sign” 
bit. 

m QNaNs that are generated as the result of a disa- 
bled Invalid Operation Exception have a “sign” bit 
of 0. 

m SNaNs that are converted to QNaNs as the result 
of a disabled Invalid Operation Exception retain 
the “sign” bit of the SNaN. 


FPSCRrprç is set to the class and sign of the result, 


except for Invalid Operation Exceptions when 
FPSCRyg= 1. 
Special Registers Altered: 

FPRF FR FI 

FX OX UX XX 

VXSNAN VXISI VXIMZ 

CR1 (if Rc=1) 


Floating Negative Multiply-Subtract 
[Single] A-form 


fnmsub FRT,FRA,FRC,FRB (Rc=0) 
fnmsub. FRT,FRA,FRC,FRB (Rc=1) 
[POWER mnemonics: fnms, fnms.] 

63 FRT FRA FRB FRC 30 |Rc 
0 6 11 16 21 26 31 
fnmsubs FRT,FRA,FRC,FRB (Rc=0) 
fnmsubs. FRT,FRA,FRC,FRB (Rc=1) 


59 FRT FRA FRB FRC 30 |Rc 
0 6 11 16 21 26 31 


The operation 
FRT € — ([(FRA)x(FRC)] — (FRB) ) 
is performed. 


The floating-point operand in register FRA is multi- 
plied by the floating-point operand in register FRC. 
The floating-point operand in register FRB is sub- 
tracted from this intermediate result. 


If the most significant bit of the resultant significand is 
not 1, the result is normalized. The result is rounded 
to the target precision under control of the Floating- 
Point Rounding Control field RN of the FPSCR, then 
negated and placed into register FRT. 


This instruction produces the same result as would be 
obtained by using the Floating Multiply-Subtract 
instruction and then negating the result, with the fol- 
lowing exceptions. 


= QNaNs propagate with no effect on their “sign” 
bit. 

m QNaNs that are generated as the result of a disa- 
bled Invalid Operation Exception have a "sign" bit 
of 0. 

m SNaNs that are converted to QNaNs as the result 
of a disabled Invalid Operation Exception retain 
the "sign" bit of the SNaN. 


FPSCRrprç is set to the class and sign of the result, 


except for Invalid Operation Exceptions when 
FPSCRyg- 1 . 
Special Registers Altered: 

FPRF FR FI 

FX OX UX XX 

VXSNAN VXISI VXIMZ 

CR1 (if Rc=1) 
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4.6.6 Floating-Point Rounding and Conversion Instructions 


Programming Note 


Examples of uses of these instructions to perform 
various conversions can be found in Section C.2, 
“Floating-Point Conversions” on page 158. 


Floating Round to Single-Precision 
X-form 


frsp FRT,FRB (Rc=0) 
frsp. FRT,FRB (Rc=1) 


63 FRT Il FRB 12 Rc 
0 6 11 16 21 31 


The floating-point operand in register FRB is rounded 
to single-precision, using the rounding mode specified 
by FPSCRan, and placed into register FRT. 


The rounding is described fully in Section A.1, 
“Floating-Point Round to Single-Precision Model” on 
page 133. 


FPSCRrprç is set to the class and sign of the result, 
except for Invalid Operation Exceptions when 
FPSCRye-1 . 


Special Registers Altered: 
FPRF FR FI 
FX OX UX XX 
VXSNAN 
CR1 (if Rc=1) 
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Floating Convert To Integer Doubleword 
X-form 


fctid FRT,FRB 
fctid. FRT,FRB 


63 FRT Il FRB 814 Rc 
0 6 11 16 21 31 


The floating-point operand in register FRB is con- 
verted to a 64-bit signed fixed-point integer, using the 
rounding mode specified by FPSCRgy, and placed into 
register FRT. 


If the operand in FRB is greater than 28 — 1, then 
FRT is set to Ox7FFF FFFF FFFF FFFF. If the 
operand in FRB is less than — 28, then FRT is set to 
0x8000 0000 0000 0000. 


The conversion is described fully in Section A.2, 
"Floating-Point Convert to Integer Model" on 
page 138. 


Except for enabled Invalid Operation Exceptions, 
FPSCReppr is undefined. FPSCRfp is set if the result 
is incremented when rounded. FPSCR- is set if the 
result is inexact. 


Special Registers Altered: 
FPRF (undefined) FR FI 
FX XX 
VXSNAN VXCVI 
CR1 (if Rc=1) 


Floating Convert To Integer Doubleword 
with round toward Zero X-form 


fctidz 
fctidz. 


FRT,FRB (Rc=0) 
FRT,FRB (Rc=1) 


63 FRT Ill FRB 815 Rc 
0 6 11 16 21 31 


The floating-point operand in register FRB is con- 
verted to a 64-bit signed fixed-point integer, using the 
rounding mode Round toward Zero, and placed into 
register FRT. 


If the operand in FRB is greater than 28 - 1, then 
FRT is set to Ox7FFF FFFF FFFF FFFF. If the 
operand in FRB is less than — 28, then FRT is set to 
0x8000 0000 0000 0000. 


The conversion is described fully in Section A.2, 
“Floating-Point Convert to Integer Model” on 
page 138. 


Except for enabled Invalid Operation Exceptions, 
FPSCRrpgre is undefined. FPSCRra is set if the result 
is incremented when rounded. FPSCRr, is set if the 
result is inexact. 


Special Registers Altered: 
FPRF (undefined) FR FI 
FX XX 
VXSNAN VXCVI 
CR1 (if Rc=1) 
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Floating Convert To Integer Word 
X-form 


fctiw FRT,FRB (Rc=0) 
fctiw. FRT,FRB (Rc=1) 
[POWER2 mnemonics: fcir, fcir.] 

63 FRT Ill FRB 14 Rc 
0 6 11 16 21 31 


The floating-point operand in register FRB is con- 
verted to a 32-bit signed fixed-point integer, using the 
rounding mode specified by FPSCRgy, and placed into 
FRT39.63. The contents of FRT,.3; are undefined. 


If the operand in FRB is greater than 2?! — 1, then bits 
32:63 of FRT are set to Ox7FFF FFFF. If the operand 
in FRB is less than — 2%, then bits 32:63 of FRT are 
set to 0x8000 0000. 


The conversion is described fully in Section A.2, 
"Floating-Point Convert to Integer Model" on 
page 138. 


Except for enabled Invalid Operation Exceptions, 
FPSCRrppF is undefined. FPSCRra is set if the result 
is incremented when rounded. FPSCRçp is set if the 
result is inexact. 


Special Registers Altered: 
FPRF (undefined) FR FI 
FX XX 
VXSNAN VXCVI 
CR1 (if Rc=1) 


Floating Convert To Integer Word with 
round toward Zero X-form 


fctiwz FRT,FRB (Rc=0) 
fctiwz. FRT,FRB (Rc=1) 
[POWER2 mnemonics: fcirz, fcirz.] 

63 FRT Ill FRB 15 Rc 
0 6 11 16 21 31 


The floating-point operand in register FRB is con- 
verted to a 32-bit signed fixed-point integer, using the 
rounding mode Round toward Zero, and placed into 
FRT39.63. The contents of FRT,.3; are undefined. 


If the operand in FRB is greater than 2%! — 1, then bits 
32:63 of FRT are set to Ox7FFF FFFF. If the operand 
in FRB is less than — 2%, then bits 32:63 of FRT are 
set to 0x8000 0000. 


The conversion is described fully in Section A.2, 
"Floating-Point Convert to Integer Model” on 
page 138. 


Except for enabled Invalid Operation Exceptions, 
FPSCRrpgre is undefined. FPSCRra is set if the result 
is incremented when rounded. FPSCRr, is set if the 
result is inexact. 


Special Registers Altered: 
FPRF (undefined) FR FI 
FX XX 
VXSNAN VXCVI 
CR1 (if Rc=1) 
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Floating Convert From Integer 
Doubleword X-form 


fefid FRT,FRB (Rc=0) 
fefid. FRT,FRB (Rc-1) 


63 FRT Ill FRB 846 Rc 
0 6 11 16 21 31 


The 64-bit signed fixed-point operand in register FRB 
is converted to an infinitely precise floating-point 
integer. The result of the conversion is rounded to 
double-precision, using the rounding mode specified 
by FPSCRgy, and placed into register FRT. 


The conversion is described fully in Section A.3, 
“Floating-Point Convert from Integer Model” on 
page 141. 


FPSCRrprç is set to the class and sign of the result. 
FPSCRgg is set if the result is incremented when 
rounded. FPSCRr, is set if the result is inexact. 


Special Registers Altered: 
FPRF FR FI 
FX XX 
CR1 (if Rc=1) 
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4.6.7 Floating-Point Compare Instructions 


The floating-point Compare instructions compare the 
contents of two floating-point registers. Comparison 
ignores the sign of zero (i.e., regards + 0 as equal to 
— 0). The comparison can be ordered or unordered. 


The comparison sets one bit in the designated CR 
field to 1 and the other three to 0. The FPCC is set in 
the same way. 


Floating Compare Unordered X-form 


fcmpu BF,FRA,FRB 


63 BF |//| FRA FRB 0 / 
0 6 9 [t1 16 21 31 


if (FRA) is a NaN or 

(FRB) is a NaN then c € 0b0001 
else if (FRA) < (FRB) then c € 0b1000 
else if (FRA) > (FRB) then c € 0b0100 
else c € 0b0010 
FPCC €c 
CR4xBF:4xBF+3 € C 
if (FRA) is an SNaN or 

(FRB) is an SNaN then 

VXSNAN € 1 


The floating-point operand in register FRA is com- 
pared to the floating-point operand in register FRB. 
The result of the compare is placed into CR field BF 
and the FPCC. 


If either of the operands is a NaN, either quiet or sig- 
naling, then CR field BF and the FPCC are set to 
reflect unordered. If either of the operands is a Sig- 
naling NaN, then VXSNAN is set. 


Special Registers Altered: 
CR field BF 
FPCC 
FX 
VXSNAN 


The CR field and the FPCC are set as follows. 


Bit Name Description 

0 FL (FRA) « (FRB) 

1 FG (FRA) » (FRB) 

2 FE (FRA) = (FRB) 

3 FU (FRA) ? (FRB) (unordered) 


Floating Compare Ordered X-form 


fcmpo BF,FRA,FRB 


63 BF | //| FRA FRB 32 / 
0 6 9 [t1 16 21 31 


if (FRA) is a NaN or 
(FRB) is a NaN then c € 0b0001 
else if (FRA) < (FRB) then c € 0b1000 
else if (FRA) > (FRB) then c € 0b0100 
else c € 0b0010 
FPCC € c 
CR4xBF:4xBF+3 € C 
if (FRA) is an SNaN or 
(FRB) is an SNaN then 
VXSNAN € 1 
if VE = 0 then VXVC € 1 
else if (FRA) is a QNaN or 
(FRB) is a QNaN then VXVC € 1 


The floating-point operand in register FRA is com- 
pared to the floating-point operand in register FRB. 
The result of the compare is placed into CR field BF 
and the FPCC. 


If either of the operands is a NaN, either quiet or sig- 
naling, then CR field BF and the FPCC are set to 
reflect unordered. If either of the operands is a Sig- 
naling NaN, then VXSNAN is set and, if Invalid Opera- 
tion is disabled (VE=0), VXVC is set. If neither 
operand is a Signaling NaN but at least one operand 
is a Quiet NaN, then VXVC is set. 


Special Registers Altered: 
CR field BF 
FPCC 
FX 
VXSNAN VXVC 
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4.6.8 Floating-Point Status and Control Register Instructions 


Every Floating-Point Status and Control Register 
instruction synchronizes the effects of all floating- 
point instructions executed by a given processor. 
Executing a Floating-Point Status and Control Register 
instruction ensures that all floating-point instructions 
previously initiated by the given processor have com- 
pleted before the Floating-Point Status and Control 
Register instruction is initiated, and that no subse- 
quent floating-point instructions are initiated by the 
given processor until the Floating-Point Status and 
Control Register instruction has completed. In partic- 
ular: 


m All exceptions that will be caused by the previ- 
ously initiated instructions are recorded in the 


Move From FPSCR X-form 


mffs FRT (Rc=0) 
mits. FRT (Rc=1) 

63 FRT Hl 111 583 Re 
0 6 11 16 21 31 


The contents of the FPSCR are placed into FRT30.¢3. 
The contents of FRT9.3; are undefined. 


Special Registers Altered: 
CR1 (if Rc=1) 


FPSCR before the Floating-Point Status and 
Control Register instruction is initiated. 


m All invocations of the system floating-point 
enabled exception error handler that will be 
caused by the previously initiated instructions 
have occurred before the Floating-Point Status 
and Control Register instruction is initiated. 


m No subsequent floating-point instruction that 
depends on or alters the settings of any FPSCR 
bits is initiated until the Floating-Point Status and 
Control Register instruction has completed. 


(Floating-point Storage Access instructions are not 
affected.) 


Move to Condition Register from FPSCR 
X-form 


merfs BF,BFA 


63 BF |///BFA|//|  /// 64 / 
0 6 9 |11 14| t6 21 31 


The contents of FPSCR field BFA are copied to Condi- 
tion Register field BF. All exception bits copied are 
set to 0 in the FPSCR. If the FX bit is copied, it is set 
to 0 in the FPSCR. 


Special Registers Altered: 


CR field BF 
FX OX (if BFA=0) 
UX ZX XX VXSNAN (if BFA=1) 
VXISI VXIDI VXZDZ VXIMZ (if BFA-2) 
VXVC (if BFA-3) 
VXSOFT VXSQRT VXCVI (if BFA=5) 
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Move To FPSCR Field Immediate 
X-form 


mtfsfi BF,U (Rc=0) 
mtfsfi. BF,U (Rc=1) 


63 BF |W| /// U|/ 134 Re 
0 6 9 |11 16 2d 21 31 


The value of the U field is placed into FPSCR field BF. 
FPSCRrx is altered only if BF = 0. 
Special Registers Altered: 


FPSCR field BF 
CR1 (if Rc-1) 


Programming Note 


When FPSCRo.3 is specified, bits 0 (FX) and 3 (OX) 
are set to the values of Ug and Us (i.e., even if 
this instruction causes OX to change from Oto 1, 
FX is set from Ug and not by the usual rule that 
FX is set to 1 when an exception bit changes from 
O to 1) Bits 1 and 2 (FEX and VX) are set 
according to the usual rule, given on page 83, and 
not from U,.». 
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Move To FPSCR Fields XFL-form 


mtfsf FLM,FRB (Rc=0) 
mtfsf. FLM,FRB (Rc=1) 

es || FIM I/| FRB 711 Re 
0 617 18 16 21 31 


The contents of bits 32:63 of register FRB are placed 
into the FPSCR under control of the field mask speci- 
fied by FLM. The field mask identifies the 4-bit fields 
affected. Let i be an integer in the range 0-7. If 
FLM;=1 then FPSCR field i (FPSCR bits 4xi:4xi+3) is 
set to the contents of the corresponding field of the 
low-order 32 bits of register FRB. 


FPSCRex is altered only if FLMo = 1. 
Special Registers Altered: 
FPSCR fields selected by mask 
CR1 (if Re=1) 


Programming Note 


Updating fewer than all eight fields of the FPSCR 
may have substantially poorer performance on 
some implementations than updating all the fields. 


Programming Note 


When FPSCRgo.3 is specified, bits O (FX) and 3 (OX) 
are set to the values of (FRB)3. and (FRB)ss (i.e., 
even if this instruction causes OX to change from 
Oto 1, FX is set from (FRB)35 and not by the usual 
rule that FX is set to 1 when an exception bit 
changes from O to 1). Bits 1 and 2 (FEX and VX) 
are set according to the usual rule, given on page 
83, and not from (FRB)33:34- 
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Move To FPSCR Bit 0 X-form 


mtísbO BT (Rc=0) 
mtfsbO. BT = 


63 BT Il 11/ 70 Re 
0 6 11 16 21 31 


Bit BT of the FPSCR is set to 0. 


Special Registers Altered: 
FPSCR bit BT 
CR1 (if Rc=1) 


Programming Note 


Bits 1 and 2 (FEX and VX) cannot be explicitly 
reset. 


Move To FPSCR Bit 1 X-form 


mtfsb1 BT (Rc=0) 
mtfsb1. BT (Rc-1) 


63 BT Ill 11/ 38 Re 
0 6 11 16 21 31 


Bit BT of the FPSCR is set to 1. 


Special Registers Altered: 
FPSCR bits BT and FX 
CR1 (if Rc=1) 


Programming Note 
Bits 1 and 2 (FEX and VX) cannot be explicitly set. 
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5.1 Fixed-Point Processor Instructions 


5.1.1 Move To/From System Register Instructions 


The optional versions of the Move To Condition Reg- 
ister Field and Move From Condition Register 
instructions move to or from a single CR field. 


Move To Condition Register Field 
XFX-form 


mtcrf FXM,RS 
0 6 11112 20/21 31 
count € 0 
do i = 0 to 7 
if FXM, = 1 then 
nti 


count € count + 1 
if count = 1 then CRaxn:axne3 € (RS) 32,4xn:3244xn+3 
else CR € undefined 


If exactly one bit of the FXM field is set to 1, let n be 
the position of that bit in the field (0 < n x 7). The 
contents of bits 32+4xn:32+4xn+3 of register RS 
are placed into CR field n (CR bits 4xn:4xn+3). Oth- 
erwise, the contents of the Condition Register are 
undefined. 


Special Registers Altered: 
CR field selected by FXM 


Programming Note 


These forms of the mterf and mfer instructions are 
intended to replace the old forms of the 
instructions (the forms shown in Section 3.3.13), 
which will eventually be phased out of the archi- 
tecture. The new forms are backward compatible 
with most processors that comply with versions of 
the architecture that precede Version 2.00. On 
those processors, the new forms are treated as 
the old forms. 


However, on some processors that comply with 
versions of the architecture that precede Version 
2.00 the new forms may be treated as follows: 


mtcrf: may cause the system illegal instruction 
error handler to be invoked 

mfcr: may copy the contents of an SPR, possibly 
a privileged SPR, into register RT 


Assembler Note 


There is no direct way for the programmer to 
specify whether the Assembler should generate 
the old forms of these instructions or the new 
forms. The Assembler should determine which 
form to generate based on the target machine, as 
well as on how the instruction is coded (i.e., 
whether an FXM field is given for mfcr and, for 
both instructions, whether the FXM field has 
exactly one bit set to 1). 


Move From Condition Register 
XFX-form 


mfcr RT,FXM 


| 
0 6 11112 20/21 31 


RT € undefined 


count € 0 
do i = 0 to 7 
if FXM, = 1 then 
n¢i 


, count ¢ count + 1 
if count = 1 then RT3244xn:32+4xn+3 € CR4xn:4xn+3 


If exactly one bit of the FXM field is set to 1, let n be 
the position of that bit in the field (0 € n x 7). The 
contents of CR field n (CR bits 4xn:4xn+3) are placed 
into bits 32+4xn:32+4xn+3 of register RT and the 
contents of the remaining bits of register RT are 
undefined. Otherwise, the contents of register RT are 
undefined. 


Special Registers Altered: 
None 


Engineering Note 


These forms of the mtcrf and mfcr instructions are 
being phased into the architecture, and must be 
implemented in processors that comply with 
Version 2.00 of the architecture specification or 
with any subsequent version. 


Architecture Note 


The processors for which the new forms of these 
instructions are not treated as the old forms are 
as follows: 


mtcrf: versions of the 630 processor that predate 
630 SOI (Illegal Instruction type Program 
interrupt) 

mfcr: Northstar processors (incorrect results) 


When the performance of systems based on these 
processors is less important than the performance 
of newer systems, the new forms of the 
instructions can be moved into the architecture 
proper. After that time, it is expected that 
systems based on Northstar processors can be 
configured to generate a Program interrupt when 
the new form of mfcr is executed. If this expecta- 
tion is met, the new forms of the instructions will 
generate a Program interrupt on all processors for 
which they are treated neither as the old forms 
nor as the new forms, and operating systems on 
the affected systems would be expected to 
emulate the new forms. 
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5.2 Floating-Point Processor Instructions 


The optional instructions described in this section are = General Purpose group: fsqrt, fsqrts 
divided into two groups. Additional groups may be m Graphics group: fres, frsqrte, fsel 


defined in the future. : : : : 
An implementation that claims to support a given 
group implements all the instructions in the group. 
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5.2.1 Floating-Point Arithmetic Instructions 


5.2.1.1 Floating-Point Elementary Arithmetic Instructions 


Floating Square Root [Single] A-form 


fsqrt FRT,FRB (Rc=0) 
fsqrt. FRT,FRB (Rc=1) 

63 FRT Ill FRB Ill 22 |Rc 
0 6 11 16 21 26 31 
fsqrts FRT,FRB (Rc=0) 
fsqrts. FRT,FRB (Rc=1) 


59 FRT IT] FRB Il 22 |Rc 
0 6 11 16 21 26 31 


The square root of the floating-point operand in reg- 
ister FRB is placed into register FRT. 


If the most significant bit of the resultant significand is 
not 1, the result is normalized. The result is rounded 
to the target precision under control of the Floating- 
Point Rounding Control field RN of the FPSCR and 
placed into register FRT. 


Operation with various special values of the operand 
is summarized below. 


Operand Result Exception 
—oo QNaN! VXSQRT 
«0 QNan! VXSORT 

—0 — None 

+oo +oo None 

SNaN ONaNÍ VXSNAN 
ONaN QNaN None 


TNo result if FPSCRyg = 1. 


FPSCRrppF is set to the class and sign of the result, 


except for Invalid Operation Exceptions when 
FPSCRyg= 1. 
Special Registers Altered: 

FPRE FR Fl 

FX XX 

VXSNAN VXSQRT 

CR1 (if Rc=1) 


Floating Reciprocal Estimate Single 
A-form 


fres FRT,FRB (Rc=0) 
tres. FRT,FRB (Rc=1) 


59 FRT HH FRB Il 24 |Rc 
0 6 11 16 21 26 31 


A single-precision estimate of the reciprocal of the 
floating-point operand in register FRB is placed into 
register FRT. The estimate placed into register FRT 
is correct to a precision of one part in 256 of the 
reciprocal of (FRB), i.e., 


estimate — 1/x 1 
E e < PA 
aBs( 1/x ) ~ 256 


where x is the initial value in FRB. Note that the 
value placed into register FRT may vary between 
implementations, and between different executions on 
the same implementation. 


Operation with various special values of the operand 
is summarized below. 


Operand Result Exception 
—oo - None 

-0 —od ZX 

+0 +od ZX 

+oo +0 None 

SNaN QNaN? VXSNAN 
QNaN QNaN None 


'No result if FPSCRzg = 1. 
No result if FPSCRyg = 1. 


FPSCRrprç is set to the class and sign of the result, 


except for Invalid Operation Exceptions when 
FPSCRye=1 and Zero Divide Exceptions when 
FPSCRzg-1. 


Special Registers Altered: 
FPRF FR (undefined) FI (undefined) 
FX OX UX ZX 
VXSNAN 
CR1 (if Rc=1) 


Architecture Note 


No double-precision version of this instruction is 
provided because graphics applications are 


expected to need only the single-precision 
version, and no other important performance- 
critical applications are expected to need a 
double-precision version. 
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Floating Reciprocal Square Root 
Estimate A-form 


FRT,FRB 
FRT,FRB 


frsqrte 
frsqrte. 


63 FRT HH FRB Il 26 |Rc 
0 6 11 16 21 26 31 


A double-precision estimate of the reciprocal of the 
square root of the floating-point operand in register 
FRB is placed into register FRT. The estimate placed 
into register FRT is correct to a precision of one part 
in 32 of the reciprocal of the square root of (FRB), i.e., 


estimate - 1/V x 1 
ABS — < => 
1 x 32 


where x is the initial value in FRB. Note that the 
value placed into register FRT may vary between 
implementations, and between different executions on 
the same implementation. 


Operation with various special values of the operand 
is summarized below. 


Operand Result Exception 
—oo QNaN? VXSORT 
<0 QNaN? VXSQRT 

-0 ~od ZX 

+0 +od 2X 

too +0 None 

SNaN QNaN? VXSNAN 
QNaN QNaN None 


'No result if FPSCRzg = 1. 
?No result if FPSCRye = 1. 


FPSCRrprç is set to the class and sign of the result, 


except for Invalid Operation Exceptions when 
FPSCRye=1 and Zero Divide Exceptions when 
FPSCRzg-1. 


Special Registers Altered: 
FPRF FR (undefined) FI (undefined) 
FX ZX 
VXSNAN VXSQRT 
CR1 (if Rc=1) 


Architecture Note 


No single-precision version of this instruction is 
provided because it would be superfluous: if (FRB) 


is representable in single format, then so is (FRT). 


5.2.2 Floating-Point Select 
Instruction 


Floating Select A-form 


fsel FRT,FRA,FRC,FRB 
fsel. FRT,FRA,FRC,FRB 


63 FRT FRA FRB FRC 23 |Rc 
0 6 11 16 21 26 31 


if (FRA) 2 0.0 then FRT + (FRC) 
else FRT € (FRB) 


The floating-point operand in register FRA is com- 
pared to the value zero. If the operand is greater 
than or equal to zero, register FRT is set to the con- 
tents of register FRC. If the operand is less than zero 
or is a NaN, register FRT is set to the contents of reg- 
ister FRB. The comparison ignores the sign of zero 
(i.e., regards +0 as equal to — 0). 


Special Registers Altered: 
CR1 (if Rc=1) 


Architecture Note 


The Select instruction is similar to a Move instruc- 
tion, and therefore does not alter the FPSCR. 


Programming Note 


Examples of uses of this instruction can be found 
in Sections C.2, “Floating-Point Conversions" on 
page 158 and C.3, “Floating-Point Selection" on 
page 160. 


Warning: Care must be taken in using fsel if IEEE 
compatibility is required, or if the values being 
tested can be NaNs or infinities; see Section C.3.4, 
"Notes" on page 160. 
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5.3 Little-Endian 


It is computed that eleven Thousand Persons have, at several Times, suffered Death, rather than submit 
to break their Eggs at the smaller End. Many hundred large Volumes have been published upon this 


Controversy .... 


The Little-Endian facility permits a program to access 
storage using Little-Endian byte ordering. 


5.3.1 Byte Ordering 


If scalars (individual data items and instructions) were 
indivisible, then there would be no such concept as 
“byte ordering”. It is meaningless to talk of the 
“order” of bits or groups of bits within the smallest 
addressable unit of storage, because nothing can be 
observed about such order. Only when scalars, which 
the programmer and processor regard as indivisible 
quantities, can be made up of more than one address- 
able unit of storage does the question of “order” 
arise. 


For a machine in which the smallest addressable unit 
of storage is the 64-bit doubleword, there is no ques- 
tion of the ordering of “bytes” within doublewords. 
All transfers of individual scalars to and from storage 
(e.g., between registers and storage) are of 
doublewords, and the address of the “byte” con- 
taining the high-order 8 bits of a scalar is no different 
from the address of a “byte” containing any other 
part of the scalar. 


For PowerPC AS, as for most computers currently 
available, the smallest addressable unit of storage is 
the 8-bit byte. Many scalars are halfwords, words, or 
doublewords, which consist of groups of bytes. When 
a word-length scalar is moved from a register to 
storage, the scalar occupies four consecutive byte 
addresses. It thus becomes meaningful to discuss the 
order of the byte addresses with respect to the value 
of the scalar: which byte contains the highest-order 8 
bits of the scalar, which byte contains the next- 
highest-order 8 bits, and so on. 


Given a scalar that spans multiple bytes, the choice of 
byte ordering is essentially arbitrary. There are 
4!= 24 ways to specify the ordering of four bytes 
within a word, but only two of these orderings are 
sensible: 


m The ordering that assigns the lowest address to 
the highest-order (“leftmost”) 8 bits of the scalar, 


Jonathan Swift, Gulliver's Travels 


the next sequential address to the next-highest- 
order 8 bits, and so on. This is called Big-Endian 
because the “big end” of the scalar, considered 
as a binary number, comes first in storage. IBM 
RISC System/6000, IBM  System/370, and 
Motorola 680x0 are examples of computers using 
this byte ordering. 


m The ordering that assigns the lowest address to 
the lowest-order (“rightmost”) 8 bits of the scalar, 
the next sequential address to the next-lowest- 
order 8 bits, and so on. This is called Little- 
Endian because the "little end" of the scalar, 
considered as a binary number, comes first in 
storage. DEC VAX and Intel x86 are examples of 
computers using this byte ordering. 


5.3.2 Structure Mapping Examples 


Figure 39 on page 125 shows an example of a C lan- 
guage structure s containing an assortment of scalars 
and one character string. The value assumed to be in 
each structure element is shown in hex in the C com- 
ments; these values are used below to show how the 
bytes making up each structure element are mapped 
into storage. 


C structure mapping rules permit the use of padding 
(skipped bytes) in order to align the scalars on desir- 
able boundaries. Figures 40 and 41 show each scalar 
aligned at its natural boundary. This alignment intro- 
duces padding of four bytes between a and b, one 
byte between d and e, and two bytes between e and f. 
The same amount of padding is present for both Big- 
Endian and Little-Endian mappings. 


5.3.2.1 Big-Endian Mapping 


The Big-Endian mapping of structure s is shown in 
Figure 40. Addresses are shown in hex at the left of 
each doubleword, and in small figures below each 
byte. The contents of each byte, as indicated in the C 
example in Figure 39, are shown in hex (as characters 
for the elements of the string). 
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struct ( 
int a; /* 0x1112 1314 word */ 
double b; /* 0x2122 2324 2526 2728 doubleword */ 
char * c; /* 0x3132 3334 word */ 
char d[7]; /* ‘A’, 'B', 'C', 'D', E' 'F', 'G' array of bytes */ 
short e; /* 0x5152 halfword */ 
int f; /* 0x6162 6364 word hi, 

s 


Figure 39. C structure 's', showing values of elements 


20 


Figure 40. Big-Endian mapping of structure 's' 


5.3.2.2 Little-Endian Mapping 


The same structure s is shown mapped Little-Endian 
in Figure 41. Doublewords are shown laid out from 
right to left, which is the common way of showing 
storage maps for Little-Endian machines. 


'Y' E! 18 


Figure 41. Little-Endian mapping of structure 's' 


5.3.3 PowerPC AS Byte Ordering 


The body of each of the three PowerPC AS Architec- 
ture Books, Book l, PowerPC AS User Instruction Set 
Architecture, Book ll, PowerPC AS Virtual Environ- 
ment Architecture, and Book lll, PowerPC AS Oper- 
ating Environment Architecture, is written as if a 
PowerPC AS system runs only in Big-Endian mode. In 
fact, a PowerPC AS system can instead run in Little- 
Endian mode, in which the instruction set behaves as 
if the byte ordering were Little-Endian, and can 
change Endian mode dynamically. The remainder of 
Section 5.3 describes how the mode is controlled, and 


how running in Little-Endian mode differs from 
running in Big-Endian mode. 


5.3.3.1 Controlling PowerPC AS Byte 
Ordering 


The Endian mode of a PowerPC AS processor is con- 
trolled by two bits: the LE (Little-Endian Mode) bit 
specifies the current mode of the processor, and the 
ILE (Interrupt Little-Endian Mode) bit specifies the 
mode that the processor enters when the system 
error handler is invoked. For both bits, a value of O 
specifies Big-Endian mode and a value of 1 specifies 
Little-Endian mode. The location of these bits and the 
requirements for altering them are described in Book 
Ill, PowerPC AS Operating Environment Architecture. 


When a PowerPC AS system comes up after power- 
on-reset, Big-Endian mode is in effect (see Book Ill). 
Thereafter, methods described in Book Ill can be used 
to change the mode, as can both invoking the system 
error handler and returning from the system error 
handler. 


Programming Note 


For a discussion of software synchronization 
requirements when altering the LE and ILE bits, 
see Book Ill (e.g., to the chapter entitled “Syn- 
chronization Requirements for Special Registers 
and for Lookaside Buffers” in Book Ill). 


Architecture Note 


The LE and ILE bits must be defined in Book III in 
a manner such that they can be changed dynam- 
ically and that the LE bit can easily be treated as 
part of a process' state. 


5.3.3.2 PowerPC AS Little-Endian Byte 
Ordering 


One might expect that a PowerPC AS system running 
in Little-Endian mode would have to perform a 2-way, 
4-way, or 8-way byte swap when transferring a 
halfword, word, or doubleword to or from storage, 
e.g., when transferring data between storage and a 
General Purpose Register or Floating-Point Register, 
when fetching instructions, and when transferring data 
between storage and an Input/Output (I/O) device. 
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PowerPC AS systems do not do such swapping, but 
instead achieve the effect of Little-Endian byte 
ordering by modifying the low-order three bits of the 
effective address (EA) as described below. Individual 
scalars actually appear in storage in Big-Endian byte 
order. 


The modification affects only the addresses presented 
to the storage subsystem (see Book Ill, PowerPC AS 
Operating Environment Architecture). All effective 
addresses in architecturally defined registers, as well 
as the Current Instruction Address (CIA) and Next 
Instruction Address (NIA), are independent of Endian 
mode. For example: 


m The effective address placed into the Link Reg- 
ister by a Branch instruction with LK=1 is equal 
to the CIA of the Branch instruction + 4. 


m The effective address placed into RA by a 
Load/Store with Update instruction is the value 
computed as described in the instruction 
description. 


m The effective addresses placed into System Reg- 
isters when the system error handler is invoked 
(e.g. SRRO, DAR; see Book Ill, PowerPC AS 
Operating Environment Architecture) are those 
that were computed or would have been com- 
puted by the interrupted program. 


Architecture Note 


In fact, the modification is performed on the real 
address (see Book lll, PowerPC AS Operating 
Environment Architecture), and not on the effec- 
tive address at all. Describing the modification 


this way makes it obvious why all effective 
addresses in architecturally defined registers, and 
in the CIA and NIA, are unaffected. However, this 
simple description cannot be used here, because 
real addresses are not defined in Book I. 


The modification is performed regardless of whether 
address translation is enabled or disabled and, if 
address translation is enabled, regardless of the 
translation mechanism used (see Book lll, PowerPC 
AS Operating Environment Architecture). The actual 
transfer of data and instructions to and from storage 
is unaffected (and thus unencumbered by multiplexors 
for byte swapping). 


The modification of the low-order three bits of the 
effective address in Little-Endian mode is done as 
follows, for access to an individual aligned scalar. 
(Alignment is as determined before this modification.) 
Access to an individual unaligned scalar or to multiple 
scalars is described in subsequent sections, as is 
access to certain architecturally defined data in 
storage, data in caches (e.g., see Book Il, PowerPC 
AS Virtual Environment Architecture, and Book lll, 
PowerPC AS Operating Environment Architecture), 
etc. 


In Little-Endian mode, the effective address is com- 
puted in the same way as in Big-Endian mode. Then, 
in Little-Endian mode only, the low-order three bits of 
the effective address are Exclusive ORed with a 
three-bit value that depends on the length of the 
operand (1, 2, 4, or 8 bytes), as shown in Table 2. 
This modified effective address is then presented to 
the storage subsystem, and data of the specified 
length are transferred to or from the addressed (as 
modified) storage locations(s). 


Data Length (bytes) EA Modification 

1 XOR with 0b1 11 

XOR with 0b110 

XOR with 0b100 
(no change) 


Table 2. PowerPC AS Little-Endian, effective address 
modification for individual aligned scalars 


The effective address modification makes it appear to 
the processor that individual aligned scalars are 
stored Little-Endian, while in fact they are stored Big- 
Endian but in different bytes within doublewords from 
the order in which they are stored in Big-Endian 
mode. 


For example, in Little-Endian mode structure s would 
be placed in storage as follows, from the point of view 
of the storage subsystem (i.e., after the effective 
address modification described above). 


00 
08 


10 'D' IC! 


10 11 
18 Gr CE! CE! 


20 


Figure 42. PowerPC AS Little-Endian, structure 's' in 
storage subsystem 


Figure 42 is identical to Figure 41 except that the byte 
numbers within each doubleword are reversed. (This 
identity is in some sense an artifact of depicting 
storage as a sequence of doublewords. If storage is 
instead depicted as a sequence of words, a single 
byte stream, etc., then no such identity appears. 
However, regardless of the unit in which storage is 
depicted or accessed, the address of a given byte in 
Figure 42 differs from the address of the same byte 
in Figure 41 only in the low-order three bits, and the 
sum of the two 3-bit values that comprise the low- 
order three bits of the two addresses is equal to 7. 
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Depicting storage as a sequence of doublewords 
makes this relationship easy to see.) 


Because of the modification performed on effective 
addresses, structure s appears to the processor to be 
mapped into storage as follows when the processor is 
in Little-Endian mode. 


Figure 43. PowerPC AS Little-Endian, structure 's' as 
seen by processor 


Notice that, as seen by the program executing in the 
processor, the mapping for structure s is identical to 
the Little-Endian mapping shown in Figure 41. From a 
point of view outside the processor, however, the 
addresses of the bytes making up structure s are as 
shown in Figure 42. These addresses match neither 
the Big-Endian mapping of Figure 40 nor the Little- 
Endian mapping of Figure 41; allowance must be 
made for this in certain circumstances (e.g., when 
performing I/O; see Section 5.3.7). 


The following four sections describe in greater detail 
the effects of running in Little-Endian mode on 
accessing data, on fetching instructions, on explicitly 
accessing the caches and any address translation 
lookaside buffers (e.g., see Book Il, PowerPC AS 
Virtual Environment Architecture, and Book Ill, 
PowerPC AS Operating Environment Architecture), 
and on doing I/O. 


Architecture Note 


The capability of running in Little-Endian mode is 
provided in order to facilitate porting Little-Endian 


application programs and operating systems to 
PowerPC AS systems. 


5.3.4 PowerPC AS Data Addressing 
in Little-Endian Mode 


5.3.4.1 Individual Aligned Scalars 


When the storage operand is aligned for any instruc- 
tion in the following classes, the effective address 
presented to the storage subsystem is computed as 
described in Section 5.3.3.2: Fixed-Point Load, Fixed- 
Point Store, Load and Store with Byte Reversal, 
Floating-Point Load, Floating-Point Store (including 
stfiwx), and Load And Reserve and Store Conditional 
(see Book Il). 


The Load and Store with Byte Reversal instructions 
have the effect of loading or storing data in the oppo- 
site Endian mode from that in which the processor is 
running. That is, data are loaded or stored in Little- 
Endian order if the processor is running in Big-Endian 
mode, and in Big-Endian order if the processor is 
running in Little-Endian mode. 


5.3.4.2 Other Scalars 


As described below, the system alignment error 
handler may be (see the subsection entitled “Indi- 
vidual Unaligned Scalars”) or is (see the subsection 
entitled “Multiple Scalars”) invoked if attempt is made 
in Little-Endian mode to execute any of the 
instructions described in the following two sub- 
sections. 


Individual Unaligned Scalars 


The “trick” of Exclusive ORing the low-order three bits 
of the effective address of an individual scalar does 
not work unless the scalar is aligned. In Little-Endian 
mode, PowerPC AS processors may cause the system 
alignment error handler to be invoked whenever any 
of the Load or Store instructions listed in Section 
5.3.4.1 is issued with an unaligned effective address, 
regardless of whether such an access could be 
handled without invoking the system alignment error 
handler in Big-Endian mode. 


PowerPC AS processors are not required to invoke 
the system alignment error handler when an una- 
ligned access is attempted in Little-Endian mode. The 
implementation may handle some or all such 
accesses without invoking the system alignment error 
handler, just as in Big-Endian mode. The architectural 
requirement is that  halfwords, words, and 
doublewords be placed in storage such that the Little- 
Endian effective address of the lowest-order byte is 
the effective address computed by the Load or Store 
instruction, the Little-Endian address of the next- 
lowest-order byte is one greater, and so on. (Load 
And Reserve and Store Conditional differ somewhat 
from the rest of the instructions listed in Section 
5.3.4.1, in that neither the implementation nor the 
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system alignment error handler is expected to handle 
these four instructions “correctly” if their operands 
are not aligned.) 


Figure 44 shows an example of a word w stored at 
Little-Endian address 5. The word is assumed to 
contain the binary value 0x1112 1314. 


12 13 14 00 


11 08 
OF 0E OD OC 0B 0A 09 | 08 


Figure 44. Little-Endian mapping of word 'w' stored 
at address 5 


In Little-Endian mode word w would be placed in 
storage as follows, from the point of view of the 
storage subsystem (i.e., after the effective address 
modification described in Section 5.3.3.2). 


Figure 45. PowerPC AS Little-Endian, word 'w' stored 
at address 5 in storage subsystem 


Notice that the unaligned word w in Figure 45 spans 
two doublewords. The two parts of the unaligned 
word are not contiguous as seen by the storage sub- 
System. 


An implementation may choose to support some but 
not all unaligned Little-Endian accesses. For example, 
an unaligned Little-Endian access that is contained 
within a single doubleword may be supported, while 
one that spans doublewords may cause the system 
alignment error handler to be invoked. 


Multiple Scalars 


PowerPC AS has two classes of instructions that 
handle multiple scalars, namely the Load and Store 
Multiple instructions and the Move Assist instructions. 
Because both classes of instructions potentially deal 
with more than one word-length scalar, neither class 
is amenable to the effective address modification 
described in Section 5.3.3.2 (e.g., pairs of aligned 
words would be accessed in reverse order from what 
the program would expect). Attempting to execute 
any of these instructions in Little-Endian mode causes 
the system alignment error handler to be invoked. 


128 PowerPC AS User Instruction Set Architecture, Tags Inactive Subset 


5.3.4.3 Page Table 


The layout of the Page Table in storage (see Book III, 
PowerPC AS Operating Environment Architecture) is 
independent of Endian mode. A given byte in the 
Page Table must be accessed using an effective 
address appropriate to the mode of the executing 
program (e.g., the high-order byte of a Page Table 
Entry must be accessed with an effective address 
ending with 0b000 in Big-Endian mode, and with an 
effective address ending with 0b111 in Little-Endian 
mode). 


Engineering Note 


An implementation that uses software assistance 
to facilitate the hardware's searching and alter- 
ation of the Page Table must supply two separate 
software routines, one for Big-Endian mode and 
one for Little-Endian mode. 


5.3.5 PowerPC AS Instruction 
Addressing in Little-Endian Mode 


Each PowerPC AS instruction occupies an aligned 
word in storage. The processor fetches and executes 
instructions as if the CIA were advanced by four for 
each sequentially fetched instruction. When the 
processor is in Little-Endian mode, the effective 
address presented to the storage subsystem in order 
to fetch an instruction is the value from the CIA, mod- 
ified as described in Section 5.3.3.2 for aligned word- 
length scalars. A Little-Endian program is thus an 
array of aligned Little-Endian words, with each word 
fetched and executed in order (discounting branches 
and invocations of the system error handler). 


Figure 46 shows an example of a small assembly lan- 
guage program p. 


loop: 
cmplwi 15,0 
beq done 
lwzux r4,r5,r6 
add r7,r7,r4 
subi r5,15,4 
b loop 
done: 
stw ri, total 


Figure 46. Assembly language program 'p' 


The Big-Endian mapping for program p is shown in 
Figure 47 (assuming the program starts at address 0). 
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00 loop: cmplwi r5,0 beg done 
o 01 02 03 | 04 05 06 07 
08 lwzux r4,r5,r6 add r7,r7,r4 
08 09 OA OB 
10 subi r5,r5,4 


10 11 
18 done: stw r7, total 
19 1B 


Figure 47. Big-Endian mapping of program 'p' 


The same program p is shown mapped Little-Endian 
in Figure 48. 


UE r7,r7,r4 rS r4, = oF 08 


NU VUE 10 
11 
done: stw r7,total 18 
1B 19 18 


Figure 48. Little-Endian mapping of program 'p' 


In Little-Endian mode program p would be placed in 
storage as follows, from the point of view of the 
Storage subsystem (i.e., after the effective address 
modification described in Section 5.3.3.2). 


00 beq done loop: cmplwi r5,0 
o 01 02 03 | 04 05 06 07 

08 add r7,r7,r4 lwzux r4,r5,r6 
0C OD OE OF 

10 subi r5,r5,4 
16 17 
18 done: stw r7,total 
1D 1F 


Figure 49. PowerPC AS Little-Endian, program 'p' in 
storage subsystem 


Figure 49 is identical to Figure 48 except that the 
byte numbers within each doubleword are reversed. 
(This identity is in some sense an artifact of depicting 
storage as a sequence of doublewords. If storage is 
instead depicted as a sequence of words, a single 
byte stream, etc., then no such identity appears. 
However, regardless of the unit in which storage is 
depicted or accessed, the address of a given byte in 
Figure 49 differs from the address of the same byte 
in Figure 48 only in the low-order three bits, and the 
sum of the two 3-bit values that comprise the low- 
order three bits of the two addresses is equal to 7. 
Depicting storage as a sequence of doublewords 
makes this relationship easy to see.) 


Each individual machine instruction appears in 
storage as a 32-bit integer containing the value 
described in the instruction description, regardless of 
the Endian mode. This is a consequence of the fact 
that individual aligned scalars are mapped in storage 
in Big-Endian byte order. 


Notice that, as seen by the processor when executing 
program p, the mapping for program p is identical to 
the Little-Endian mapping shown in Figure 48. From a 
point of view outside the processor, however, the 
addresses of the bytes making up program p are as 
shown in Figure 49. These addresses match neither 
the Big-Endian mapping of Figure 47 nor the Little- 
Endian mapping of Figure 48. 


All instruction effective addresses visible to an exe- 
cuting program are the effective addresses that are 
computed by that program or, in the case of the 
System error handler, effective addresses that were 
or could have been computed by the interrupted 
program. These effective addresses are independent 
of Endian mode. Examples for Little-Endian mode 
include the following. 


m An instruction address placed into the Link Reg- 
ister by a Branch instruction with LK=1, or an 
instruction address saved in a System Register 
when the system error handler is invoked, is the 
effective address that a program executing in 
Little-Endian mode would use to access the 
instruction as a data word using a Load instruc- 
tion. 


m An offset in a relative Branch instruction (Branch 
or Branch Conditional with AA=0) reflects the 
difference between the addresses of the branch 
and target instructions, using the addresses that 
a program executing in Little-Endian mode would 
use to access the instructions as data words 
using Load instructions. 


m A target address in an absolute Branch instruc- 
tion (Branch or Branch Conditional with AA=1) is 
the address that a program executing in Little- 
Endian mode would use to access the target 
instruction as a data word using a Load instruc- 
tion. 


m The storage locations that contain the first set of 
instructions executed by each kind of system 
error handler must be set in a manner consistent 
with the Endian mode in which the system error 
handler will be invoked. (These sets of 
instructions occupy architecturally defined 
locations; see Book Ill, PowerPC AS Operating 
Environment Architecture.) Thus if the system 
error handler is to be invoked in Little-Endian 
mode, the first set of instructions for each kind of 
system error handler must appear in storage, 
from the point of view of the storage subsystem 
(i.e., after the effective address modification 
described in Section 5.3.3.2), with the pair of 
instructions within each doubleword reversed 
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from the order in which they are to be executed. 
(If the instructions are placed into storage by a 
program running in the same Endian mode as 
that in which the system error handler will be 
invoked, the appropriate order will be achieved 
naturally.) 


Programming Note 


In general, a given subroutine in storage cannot 
be shared between programs running in different 
Endian modes. This affects the sharing of subrou- 
tine libraries. 


Engineering Note 


If the Endian mode changes because an sc, Trap, 
or rfid (see Book IIl) instruction was executed or 
because an interrupt occurred, subsequent 
instructions must be executed in the correct order 
as determined by the new Endian mode (MSR,g) 
regardless of the Endian mode that was in effect 
when the instructions were fetched into the 
instruction cache. Implementations that condi- 
tionally reverse the order of instructions within 
doublewords depending on the current Endian 
mode when placing instructions into the instruc- 
tion cache must correct the instruction order 
when the Endian mode is changed by the occur- 
rences listed at the beginning of this Note. 
However, restrictions may apply when the Endian 
mode is changed by the execution of an mtmsr{ d] 
instruction; e.g., see the chapter entitled "Syn- 
chronization Requirements for Special Registers 
and for Lookaside Buffers" in Book III. 


5.3.6 PowerPC AS Cache 
Management Instructions in 
Little-Endian Mode 


Instructions for explicitly accessing the caches (see 
Book Il, PowerPC AS Virtual Environment 
Architecture) are unaffected by Endian mode. (Iden- 
tification of the block to be accessed is not affected 
by the low-order three bits of the effective address.) 


5.3.7 PowerPC AS 1/0 in 
Little-Endian Mode 


Input/output (I/O), such as writing the contents of a 
large area of storage to disk, transfers a byte stream 
on both Big-Endian and Little-Endian systems. For the 
disk transfer, the first byte of the area is written to 
the first byte of the disk record and so on. 


For a PowerPC AS system running in Big-Endian 
mode, I/O transfers happen “naturally” because the 


byte that the processor sees as byte O is the same 
one that the storage subsystem sees as byte 0. 


For a PowerPC AS system running in Little-Endian 
mode this is not the case, because of the modification 
of the low-order three bits of the effective address 
when the processor accesses storage. In order for 
I/O transfers to transfer byte streams properly, in 
Little-Endian mode l/O transfers must be performed 
as if the bytes transferred were accessed one byte at 
a time, using the address modification described in 
Section 5.3.3.2 for single-byte scalars. This does not 
mean that I/O on Little-Endian PowerPC AS systems 
must use only 1-byte-wide transfers; data transfers 
can be as wide as desired, but the order of the bytes 
transferred within doublewords must appear as if the 
bytes were fetched or stored one byte at a time. See 
the System Architecture documentation for a given 
PowerPC AS system for details on the transfer width 
and byte ordering on that system. 


However, not all I/O done on PowerPC AS systems is 
for large areas of storage as described above. 1/0 
can be performed with certain devices merely by 
storing to or loading from addresses that are associ- 
ated with the devices (the terms “memory-mapped 
1/0” and "programmed 1/0” or “PIO” are used for 
this). For such PIO transfers, care must be taken 
when defining the addresses to be used, for these 
addresses are subject to the effective address modifi- 
cation shown in Table 2 on page 126. A Load or 
Store instruction that maps to a control register on a 
device may require that the value loaded or stored 
have its bytes reversed; if this is required, the Load 
and Store with Byte Reversal instructions can be 
used. Any requirement for such byte reversal for a 
particular l/O device register is independent of 
whether the PowerPC AS system is running in Big- 
Endian or Little-Endian mode. 


Similarly, the address sent to an I/O device by an 
eciwx or ecowx instruction (see Book Il, PowerPC AS 
Virtual Environment Architecture) is subject to the 
effective address modification shown in Table 2. 


5.3.8 Origin of Endian 


The terms Big-Endian and Little-Endian come from 
Part |, Chapter 4, of Jonathan Swift's Gulliver's 
Travels. Here is the complete passage, from the 
edition printed in 1734 by George Faulkner in Dublin. 


... our Histories of six Thousand Moons make 
no Mention of any other Regions, than the 
two great Empires of Lilliput and Blefuscu. 
Which two mighty Powers have, as | was 
going to tell you, been engaged in a most 
obstinate War for six and thirty Moons past. 
It began upon the following Occasion. |t is 
allowed on all Hands, that the primitive Way 
of breaking Eggs before we eat them, was 
upon the larger End: But his present Majes- 
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ty's Grand-father, while he was a Boy, going 
to eat an Egg, and breaking it according to 
the ancient Practice, happened to cut one of 
his Fingers. Whereupon the Emperor his 
Father, published an Edict, commanding all 
his Subjects, upon great Penalties, to break 
the smaller End of their Eggs. The People so 
highly resented this Law, that our Histories 
tell us, there have been six Rebellions raised 
on that Account; wherein one Emperor lost 
his Life, and another his Crown. These civil 
Commotions were constantly fomented by the 
Monarchs of Blefuscu; and when they were 
quelled, the Exiles always fled for Refuge to 
that Empire. It is computed that eleven 
Thousand Persons have, at several Times, 
suffered Death, rather than submit to break 
their Eggs at the smaller End. Many hundred 
large Volumes have been published upon this 
Controversy: But the Books of the Big- 
Endians have been long forbidden, and the 
whole Party rendered incapable by Law of 
holding Employments. During the Course of 
these Troubles, the Emperors of Blefuscu did 
frequently expostulate by their Ambassadors, 
accusing us of making a Schism in Religion, 
by offending against a fundamental Doctrine 
of our great Prophet Lustrog, in the fifty- 
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fourth Chapter of the Brundrecal, (which is 
their Alcoran.) This, however, is thought to 
be a mere Strain upon the text: For the 
Words are these; That all true Believers shall 
break their Eggs at the convenient End: and 
which is the convenient End, seems, in my 
humble Opinion, to be left to every Man's 
Conscience, or at least in the Power of the 
chief Magistrate to determine. Now the Big- 
Endian Exiles have found so much Credit in 
the Emperor of Blefuscu's Court; and so 
much private Assistance and Encouragement 
from their Party here at home, that a bloody 
War has been carried on between the two 
Empires for six and thirty Moons with various 
Success; during which Time we have lost 
Forty Capital Ships, and a much greater 
Number of smaller Vessels, together with 
thirty thousand of our best Seamen and Sol- 
diers; and the Damage received by the 
Enemy is reckoned to be somewhat greater 
than ours. However, they have now 
equipped a numerous Fleet, and are just pre- 
paring to make a Descent upon us: and his 
Imperial Majesty, placing great Confidence in 
your Valour and Strength, hath commanded 
me to lay this Account of his Affairs before 
you. 
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Appendix A. Suggested Floating-Point Models 


A.1 Floating-Point Round to Single-Precision Model 


The following describes algorithmically the operation of the Floating Round to Single-Precision instruction. 


If (FRB),44 < 897 and (FRB),.6 > 0 then 
Do 
If FPSCRye = 0 then goto Disabled Exponent Underflow 
If FPSCRye = 1 then goto Enabled Exponent Underflow 
End 


If (FRB)4-44 > 1150 and (FRB),44 < 2047 then 
Do 


If FPSCRog = 0 then goto Disabled Exponent Overflow 
If FPSCRog = 1 then goto Enabled Exponent Overflow 
End 


If (FRB),44 > 896 and (FRB),44 < 1151 then goto Normal Operand 
If (FRB)45 = 0 then goto Zero Operand 


If (FRB)444 = 2047 then 
Do 
If (FRB)40.63 = 0 then goto Infinity Operand 
If (FRB),> = 1 then goto QNaN Operand 
If (FRB)49 = 0 and (FRB)+3:63 > 0 then goto SNaN Operand 
End 
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Disabled Exponent Underflow: 


sign € (FRB)o 
If (FRB),44 = 0 then 
Do 


exp t — 1022 
fraCg:52 € 0b0 | (FRB) 12.63 
End 
If (FRB)444 > 0 then 
Do 
exp € (FRB),44 — 1023 
fraco.59 € 0b1 | (FRB)1243 
End 
Denormalize operand: 
G | R |X ¢ 0b000 
Do while exp < — 126 
exp € exp + 1 
fraco:s2 |G |R |X 0b0 |fraco:s52 | G | (R| X) 
d 


n 
FPSCRyx + (frac24:52 |G | R | X) > 0 
Round Single(sign,exp,fraco.5o,G,R,X) 
FPSCRxx + FPSCRxx | FPSCRE; 

If fraco;so = O then 


Do 
FRTo € sign 
If sign = Othen FPSCRepge € “+zero” 
If sign = 1 then FPSCRepge € “— zero” 
End 
If fracg.5o > O then 
Do 


If fracg = 1 then 
Do 
If sign = 0 then FPSCRepge € “+normal number” 
If sign = 1 then FPSCRepge + “- normal number” 
End 
If fracg = 0 then 
Do 
If sign = 0 then FPSCRepge € “+ denormalized number” 
If sign = 1 then FPSCRepge + "- denormalized number" 
End 
Normalize operand: 
Do while fracg = O 
exp € exp- 1 
fraCo:so + fracy.50 | 0b0 


FRTo € sign 
FRT 4-44 ¢ exp + 1023 
FRT i263 € frac,» 
End 
Done 
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Enabled Exponent Underflow: 


FPSCRux € 1 

sign € (FRB) 

If (FRB),44 = 0 then 
Do 


exp t — 1022 
fraCo:52 € 0b0 | (FRB)42:3 
nd 


If (FRB)4,44 > 0 then 
Do 


exp € (FRB),4, — 1023 
fra0gs2 € Ob1 | (FRB)125 
End 
Normalize operand: 
Do while fracy = 0 
exp € exp - 1 
fraCo:52 + frac; .52 | 0b0 
End 
Round Single(sign,exp,fracç.52,0,0,0) 
FPSCRxx € FPSCRxx | FPSCRE; 
exp € exp + 192 
FRTo € sign 
FRT4.4, € exp + 1023 
FRT 2:69 € fracy:5o 
If sign = 0 then FPSCReppF € “+normal number” 
If sign = 1 then FPSCRepge € “- normal number” 
Done 


Disabled Exponent Overflow: 


FPSCRox * 1 
If FPSCRay = 0b00 then /* Round to Nearest */ 
Do 
If (FRB)g = 0 then FRT + Ox7FFO 0000 0000 0000 
If (FRB)g = 1then FRT + OxFFFO 0000 0000 0000 
If (FRB) = 0 then FPSCReppr € “+ infinity” 
If (FRB)g = 1 then FPSCReprç € “= infinity" 
End 
If FPSCRgy = 0b01 then /* Round toward Zero */ 
Do 
If (FRB) = 0 then FRT € Ox47EF FFFF E000 0000 
If (FRB)o= 1then FRT + OxC7EF FFFF E000 0000 
If (FRB)o = 0 then FPSCRepge € *« normal number" 
If (FRB)y = 1 then FPSCRrprç € “- normal number" 
End 
If FPSCRay = 0b10 then /* Round toward «Infinity */ 
Do 
If (FRB)g = 0 then FRT + Ox7FFO 0000 0000 0000 
If (FRB) = 1 then FRT € OxC7EF FFFF E000 0000 
If (FRB)p = 0 then FPSCRepge € “+ infinity" 
If (FRB)o = 1 then FPSCRgpge € “- normal number" 
End 
If FPSCRay = 0b11 then /* Round toward - Infinity */ 
Do 
If (FRB) = 0 then FRT € Ox47EF FFFF E000 0000 
If (FRB)g = 1then FRT + OxFFFO 0000 0000 0000 
If (FRB)o = 0 then FPSCRrprç € "normal number" 
If (FRB)g = 1 then FPSCReprç € “infinity” 
End 
FPSCRgg € undefined 
FPSCRg, € 1 
FPSCRyx € 1 
Done 
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Enabled Exponent Overflow: 


sign € (FRB)o 

exp € (FRB),4; — 1023 

fracoso € 0b1 | (FRB)y2:63 

Round Single(sign,exp,fraco.55,0,0,0) 

FPSCRxx € FPSCRxx | FPSCRr, 
Enabled Overflow: 

FPSCRox € 1 

exp t exp — 192 

FRTo € sign 

FRT;.41 + exp + 1023 

FRT42:63 € frac? 


If sign = 0 then FPSCRfppF € “+normal number” 
If sign = 1 then FPSCRepge + "- normal number” 


Done 


Zero Operand: 


FRT « (FRB) 

If (FRB)y = 0 then FPSCRrepne € “+ zero” 
If (FRB)g = 1 then FPSCRrepnr € “- zero” 
FPSCRrg fı € 0b00 

Done 


Infinity Operand: 


FRT € (FRB) 

If (FRB)y = 0then FPSCRrppf € “+ infinity” 
If (FRB)g = 1 then FPSCReppF € “infinity” 
FPSCRep p, € 0b00 

Done 


QNaN Operand: 


FRT € (FRB)o:34 | 20 
FPSCRrppr € "QNaN" 
FPSCReg fı + 0b00 
Done 


SNaN Operand: 


If FPSCRyg = 0 then 
Do 
FRTo41 + (FRB)o.14 
FRT 4 1 
FRT 13:63 € (FRB) 13:34 | 20 
FPSCRepge € “QNaN” 
End 
FPSCRep pı € 0b00 
Done 
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Normal Operand: 


sign € (FRB)o 

exp € (FRB),41- 1023 

fracos2 € 0b1 | (FRB)42:63 

Round Single(sign,exp,fraco.55,0,0,0) 

FPSCRxx € FPSCRxx | FPSCRE; 

If exp > 127 and FPSCRgg = 0 then go to Disabled Exponent Overflow 
If exp > 127 and FPSCRo¢ = 1 then go to Enabled Overflow 
FRTo € sign 

FRT;41 ¢ exp + 1023 

FRT 2:69 € fracy.s 

If sign = 0 then FPSCReppF € “+ normal number” 

If sign = 1 then FPSCRepge + “— normal number” 

Done 


Round Single(sign, exp, fraco.50,G,R,X): 


inc € 0 

lsb € fracos 

gbit € fraco 

rbit € fracos 

xbit € (fraco6:52 IG IR \X)# 0 

If FPSCRgy = 0b00 then /* Round to Nearest */ 

Do /* comparisons ignore u bits */ 

If sign |Isb | gbit | rbit | xbit = Obuttuu then inc € 1 
If sign |Isb | gbit | rbit | xbit = Obu011u then inc € 1 
If sign | Isb | gbit | rbit | xbit = ObuO1u1 then inc € 1 


End 
If FPSCRan = 0b10 then /* Round toward + Infinity */ 
Do /* comparisons ignore u bits */ 


If sign |Isb | gbit | rbit | xbit = ObOutuu then inc ¢ 1 
If sign |Isb | gbit | rbit | xbit = Ob0uutu then inc € 1 
If sign | Isb | gbit | rbit | xbit = ObOuuut then inc + 1 


End 
If FPSCRgy = 0b11 then /* Round toward - Infinity */ 
Do /* comparisons ignore u bits */ 


If sign | Isb | gbit | rbit | xbit = Obtutuu then inc + 1 
If sign |Isb | gbit | rbit | xbit = Obtuutu then inc € 1 
If sign |Isb | gbit | rbit | xbit = Obtuuul then inc € 1 
End 
fraCg.o4 € fracg.o3 + inc 
If carry out = 1 then 
Do 
fraCo:23 + 0b1 | fraco. 
exp € exp + 1 
End 
fraCo4:50 ¢ 290 
FPSCRgg € inc 
FPSCRf; € gbit | rbit | xbit 
Return 
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A.2 Floating-Point Convert to Integer Model 


The following describes algorithmically the operation of the Floating Convert To Integer instructions. 


If Floating Convert To Integer Word then 
Do 
round mode + FPSCRry 
tgt precision € “32-bit integer” 
End 


If Floating Convert To Integer Word with round toward Zero then 
Do 
round mode € 0b01 
tgt precision € “32-bit integer” 
End 


If Floating Convert To Integer Doubleword then 
Do 
round mode + FPSCRay 
tgt precision € “64-bit integer” 
End 


If Floating Convert To Integer Doubleword with round toward Zero then 
Do 
round mode € 0b01 
tgt precision € “64-bit integer” 
End 


sign € (FRB)o 

If (FRB),41 = 2047 and (FRB),5.63 = 0 then goto Infinity Operand 
( 412. 2047 and (FRB), = 0 then goto SNaN Operand 

If (FRB),41 = 2047 and (FRB); = 1 then goto QNaN Operand 

If (FRB),44 > 1086 then goto Large Operand 


If (FRB),44 > O then exp € (FRB),4, — 1023 /* exp — bias */ 

If (FRB),44 = 0 then exp € — 1022 

If (FRB)444 > O then fraco:g € 0b01 | (FRB),5.63 | 10 /* normal; need leading 0 for later complement */ 
If (FRB),4, = 0 then fracos € 0b00 | (FRB),5:63 | 110 /* denormal */ 


gbit | rbit | xbit € Ob000 
Do i=1,63-exp /* do the loop 0 times if exp = 63 */ 

fracos | gbit | rbit | xbit € ObO | fracos | gbit | (rbit | xbit) 
End 


Round Integer(sign,fraco.g4,gbit,rbit,xbit,round_mode) 

If sign = 1 then fraco.g4 € ~ fraco: + 1 /* needed leading 0 for — 29^ < (FRB) < — 283 */ 

If tgt precision = “32-bit integer” and fraco.g4 > 231— 1 then goto Large Operand 

If tgt precision = “64-bit integer” and fraco.g4 > 263%- 1 then goto Large Operand 

If tgt_precision = “32-bit integer” and fracg.g4 < — 2% then goto Large Operand 

If tgt precision = “64-bit integer" and fraco.g4 < — 29? then goto Large Operand 

FPSCRxx € FPSCRxx | FPSCRE; 

If tgt precision = “32-bit integer” then FRT € Oxuuuu uuuu | fracgg.g4 /* u is undefined hex digit */ 
If tgt precision = “64-bit integer” then FRT + fracy.g4 


FPSCRrppAF € undefined 
Done 
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Round Integer(sign, fraco.g4,gbit,rbit, xbit,round_mode): 


inc ¢ 0 
If round mode = 0b00 then /* Round to Nearest */ 
Do /* comparisons ignore u bits */ 


If sign | fracg, | gbit | rbit | xbit = Obulluu then inc € 1 
If sign | fracg, | gbit | rbit | xbit = Obu01tu then inc + 1 
If sign | fracg, | gbit | rbit | xbit = Obu01u1 then inc € 1 


End 
If round mode = 0b10 then /* Round toward +Infinity */ 
Do /* comparisons ignore u bits */ 


If sign | fracg, | gbit | rbit | xbit = ObOutuu then inc € 1 
If sign | fracg, | gbit | rbit | xbit = Ob0uutu then inc € 1 
If sign | fracg, | gbit | rbit | xbit = Ob0uuu1 then inc € 1 


End 
If round_mode = 0b11 then /* Round toward - Infinity */ 
Do /* comparisons ignore u bits */ 


If sign | fracg, | gbit | rbit | xbit = Obtutuu then inc € 1 
If sign | fracg, | gbit | rbit | xbit = Obtuutu then inc € 1 
If sign | fracg, | gbit | rbit | xbit = Obtuuut then inc € 1 
End 

fracos € fracgg, + inc 

FPSCRgg € inc 

FPSCRE; € gbit | rbit | xbit 

Return 


Infinity Operand: 
FPSCRer ri vxcvi € 06001 


If FPSCRye = 0 then Do 
If tgt precision = “32-bit integer” then 


Do 
If sign = 0 then FRT + Oxuuuu uuuu 7FFF FFFF /* u is undefined hex digit */ 
If sign = 1 then FRT € Oxuuuu uuuu 8000 0000 /* u is undefined hex digit */ 
End 
Else 
Do 


If sign = 0 then FRT € Ox7FFF FFFF FFFF FFFF 
If sign = 1 then FRT € 0x8000 0000 0000 0000 
End 
FPSCRpppge € undefined 
End 
Done 


SNaN Operand: 


FPSCRer ri vxsnan vxcvi € 00011 
If FPSCRyg = 0 then 
Do 
If tgt precision = “32-bit integer" then FRT € Oxuuuu uuuu 8000 0000 /*u is undefined hex digit */ 
If tgt precision = “64-bit integer" then FRT € 0x8000 0000 0000 0000 
FPSCRrprç € undefined 
End 
Done 
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QNaN Operand: 


FPSCRer Fi vxcvi € 0b001 
If FPSCRyg = 0 then 
Do 
If tgt precision = “32-bit integer" then FRT € Oxuuuu uuuu 8000 0000 /*u is undefined hex digit */ 
If tgt precision = “64-bit integer" then FRT + 0x8000 0000 0000 0000 
FPSCRrprç € undefined 
End 
Done 


Large Operand: 
FPSCREr Fi vxcvi € 05001 
If FPSCRyg = 0 then Do 
If tgt precision = “32-bit integer" then 
Do 


If sign = 0 then FRT + Oxuuuu uuuu 7FFF FFFF /*uis undefined hex digit */ 
If sign = 1 then FRT € Oxuuuu uuuu 8000 0000 /*uis undefined hex digit */ 


If sign = 0 then FRT € Ox7FFF FFFF FFFF FFFF 
If sign = 1 then FRT + 0x8000 0000 0000 0000 


FPSCRrppAF € undefined 
End 


Done 
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A.3 Floating-Point Convert from Integer Model 


The following describes algorithmically the operation of the Floating Convert From Integer Doubleword instruction. 


sign € (FRB)o 
exp € 63 
fraco.g3 € (FRB) 


If fracg.g3 = O then go to Zero Operand 


If sign = 1 then fracg.g3 € = fracg.g3 + 1 


Do while fracg = 0 /* do the loop 0 times if (FRB) = maximum negative integer */ 


fraCo:63 ¢ fracy.¢3 | 0b0 
exp € exp - 1 
End 


Round Float(sign,exp,fracg.g3,FPSCRan) 


If sign = O then FPSCReppF € “+normal number” 
If sign = 1 then FPSCRepge € “- normal number” 


FRTo € sign 


FRT,44 € exp + 1023 /* exp + bias */ 


FRT 263 € fracy.so 
Done 


Zero Operand: 


FPSCRep pı € 0b00 

FPSCRrpnr € “+zero” 

FRT € 0x0000_0000_0000_0000 
Done 
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Round Float(sign, exp, fraco.g3,round_mode): 


inc € 0 

lsb € fracs> 

gbit € fracss 

rbit € fracs, 

Xbit € fracsseg > 0 

If round mode = 0b00 then /* Round to Nearest */ 

Do /* comparisons ignore u bits */ 

If sign | Isb | gbit | rbit | xbit = Obut1uu then inc € 1 
If sign | Isb | gbit | rbit | xbit = Obu011u then inc € 1 
If sign |Isb | gbit | rbit | xbit = Obu01ut then inc € 1 


End 
If round mode = 0b10 then /* Round toward +Infinity */ 
Do /* comparisons ignore u bits */ 


If sign | Isb | gbit | rbit | xbit = ObOutuu then inc € 1 
If sign | Isb | gbit | rbit | xbit = ObOuutu then inc € 1 
If sign | Isb | gbit | rbit | xbit = Ob0uuut then inc € 1 


End 
If round mode = 0b11 then /* Round toward - Infinity */ 
Do /* comparisons ignore u bits */ 


If sign | Isb | gbit | rbit | xbit = Obtutuu then inc € 1 
If sign |Isb | gbit | rbit | xbit = Obtuutu then inc € 1 
If sign |Isb | gbit | rbit | xbit = Obtuuut then inc € 1 
End 

fraco.so € fracg.5o + inc 

If carry_out = 1 then exp € exp+ 1 

FPSCRgg € inc 

FPSCRe, € gbit | rbit | xbit 

FPSCRxx € FPSCRyx | FPSCRe 

Return 
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Appendix B. Assembler Extended Mnemonics 


In order to make assembler language programs simpler to write and easier to understand, a set of extended 
mnemonics and symbols is provided that defines simple shorthand for the most frequently used forms of Branch 
Conditional, Compare, Trap, Rotate and Shift, and certain other instructions. 


Assemblers should provide the extended mnemonics and symbols listed here, and may provide others. 


B.1 Symbols 


The following symbols are defined for use in instructions (basic or extended mnemonics) that specify a Condition 
Register field or a Condition Register bit. The first five (It, ..., un) identify a bit number within a CR field. The 
remainder (cr0, ..., cr7) identify a CR field. An expression in which a CR field symbol is multiplied by 4 and then 
added to a bit-number-within-CR-field symbol can be used to identify a CR bit. 


Symbol Value Meaning 


It 0 Less than 

gt 1 Greater than 

eq 2 Equal 

so 3 Summary overflow 
un 3 Unordered (after floating-point comparison) 
cro 0 CR Field 0 

cri 1 CR Field 1 

cr2 2 CR Field 2 

cr3 3 CR Field 3 

cr4 4 CR Field 4 

cr5 5 CR Field 5 

cr6 6 CR Field 6 

cr7 7 CRField 7 


The extended mnemonics in Sections B.2.2 and B.3 require identification of a CR bit: if one of the CR field symbols 
is used, it must be multiplied by 4 and added to a bit-number-within-CR-field (value in the range 0-3, explicit or 
symbolic). The extended mnemonics in Sections B.2.3 and B.5 require identification of a CR field: if one of the CR 
field symbols is used, it must not be multiplied by 4. (For the extended mnemonics in Section B.2.3, the bit 
number within the CR field is part of the extended mnemonic. The programmer identifies the CR field, and the 
Assembler does the multiplication and addition required to produce a CR bit number for the BI field of the under- 
lying basic mnemonic.) 
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B.2 Branch Mnemonics 


The mnemonics discussed in this section are variations of the Branch Conditional instructions. 


Note: bclr, bclrl, bcctr, and bcctrl each serve as both a basic and an extended mnemonic. The Assembler will 
recognize a belr, belrl, bcctr, or bcctrl mnemonic with three operands as the basic form, and a belr, bclrl, bcctr, or 
bcctrl mnemonic with two operands as the extended form. In the extended form the BH operand is omitted and 
assumed to be 0b00. Similarly, for all the extended mnemonics described in Sections B.2.2 - B.2.4 that devolve to 
any of these four basic mnemonics the BH operand can either be coded or omitted. If it is omitted it is assumed 
to be 0b00. 


B.2.1 BO and BI Fields 


The 5-bit BO and BI fields control whether the branch is taken. Providing an extended mnemonic for every pos- 
sible combination of these fields would be neither useful nor practical. The mnemonics described in Sections 
B.2.2 - B.2.4 include the most useful cases. Other cases can be coded using a basic Branch Conditional mne- 
monic (bc[I] [a], bcIr[ I], bectr[I]) with the appropriate operands. 


B.2.2 Simple Branch Mnemonics 


Instructions using one of the mnemonics in Table 3 that tests a Condition Register bit specify the corresponding 
bit as the first operand. The symbols defined in Section B.1 can be used in this operand. 


Notice that there are no extended mnemonics for relative and absolute unconditional branches. For these the 
basic mnemonics b, ba, bl, and bla should be used. 


Table 3. Simple branch mnemonics 
LR not Set LR Set 
Branch Semantics be bea belr bectr bel bela belrl bectrl 
Relative | Absolute | To LR | To CTR | Relative | Absolute | To LR | To CTR 
Branch unconditionally — — blr betr — — birl betrl 
Branch if CRg;= 1 bt bta btlr btctr bil bila btlrl btctrl 
Branch if CRg¡=0 bf f bflrl bfctrl 
Decrement CTR, 
branch it CTR nonzero bdnz bdnza bdnzlr — bdnzl bdnzla | bdnzirl — 
Decrement CTR, 
branch if CTR nonzero bdnzt bdnzta | bdnztlr — bdnztl bdnztla | bdnztlrl — 
and CRp)= 1 
Decrement CTR, 
branch if CTR nonzero bdnzf bdnzfa bdnzflr — bdnzfl bdnzfla | bdnzflrl — 
and CRgı= 0 
Decrement CTR, bdlri E 
branch if CTR zero 
Decrement CTR, 
branch if CTR zero bdzt bdzta bdztlr — bdztl bdztla bdztlr| — 
and CRp)= 1 
Decrement CTR, 
branch if CTR zero bdzf bdzfa bdzflr — bdzfl bdzfla bdzflrl — 
and CRgi- 0 
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Examples 


1. Decrement CTR and branch if it is still nonzero (closure of a loop controlled by a count loaded into CTR). 


bdnz target (equivalent to: bc 16,0,target) 
2. Same as (1) but branch only if CTR is nonzero and condition in CRO is “equal”. 

bdnzt eq,target (equivalent to: bc 8,2,target) 
3. Same as (2), but “equal” condition is in CR5. 

bdnzt 4*cr5+eq,target (equivalent to: bc 8,22,target) 
4. Branch if bit 27 of CR is 0. 

bf 27 target (equivalent to: bc 4,27,target) 


5. Same as (4), but set the Link Register. This is a form of conditional “call”. 


bfl 27,target (equivalent to: bel 4,27,target) 


B.2.3 Branch Mnemonics Incorporating Conditions 


In the mnemonics defined in Table 4 on page 146, the test of a bit in a Condition Register field is encoded in the 


mnemonic. 


Instructions using the mnemonics in Table 4 specify the Condition Register field as an optional first operand. 


One 


of the CR field symbols defined in Section B.1 can be used for this operand. If the CR field being tested is CR 
Field O, this operand need not be specified unless the resulting basic mnemonic is bclr[I] or bectr[ If] and the BH 


operand is specified. 
A standard set of codes has been adopted for the most common combinations of branch conditions. 


Code Meaning 


It Less than 
le Less than or equal 
eq Equal 


ge Greater than or equal 

gt Greater than 

nl Not less than 

ne Not equal 

ng Not greater than 

so Summary overflow 

ns Not summary overflow 

un Unordered (after floating-point comparison) 

nu Not unordered (after floating-point comparison) 


These codes are reflected in the mnemonics shown in Table 4. 
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Table 4. Branch mnemonics incorporating conditions 


LR not Set LR Set 

Branch Semantics be belr bectr belrl | bectrl 
Relative | Absolute | To LR | To CTR | Relative | Absolute | To LR | To CTR 

Branch if less than bit bita bltlr bltctr bitl bitla bltIrl bltctrl 
Branch if less than or equal ble blea blelr blectr blel blela blelrl blectrl 
Branch if equal beq beqa beglr beqctr begl begla beglrl | beqctrl 
Branch if greater than or equal bge bgelr bgelrl | bgectrl 
Branch if greater than bgt bgta bgtlr bgtctr bgtl bgtla batir! bgtctrl 
Branch if not less than bnl bnla bnilr bnictr bnil bnila bnilrl bnlctrl 
Branch if not equal bne bnea bnelr bnectr bnel bnela bnelrl | bnectrl 
Branch if summary overflow bso bsoa bsolr bsoctr bsol bsola bsolrl | bsoctrl 
Branch if not summary overflow bns bnsa bnslr bnsctr bnsl bnsla bnslrl | bnsctrl 
Branch if unordered bun buna bunlr bunctr bunl bunla bunlrl | bunctrl 


Branch if not unordered bnu bnulr bnuctr bnulrl | bnuctrl 


Examples 


1. Branch if CRO reflects condition “not equal”. 


bne target (equivalent to: bc 4,2,target) 
2. Same as (1), but condition is in CR3. 
bne  cr3,target (equivalent to: bc 4,14,target) 


3. Branch to an absolute target if CR4 specifies “greater than”, setting the Link Register. This is a form of 
conditional “call”. 


bgtla cr4,target (equivalent to: bcla 12,17,target) 
4. Same as (3), but target address is in the Count Register. 


bgtctrl cr4 (equivalent to: bectrl 12,17,0) 


B.2.4 Branch Prediction 


Software can use the “at” bits of Branch Conditional instructions to provide a hint to the processor about the 
behavior of the branch. If, for a given such instruction, the branch is almost always taken or almost always not 
taken, a suffix can be added to the mnemonic indicating the value to be used for the “at” bits. 


+ Predict branch to be taken (at=0b11) 
— Predict branch not to be taken (at=0b10) 


Such a suffix can be added to any Branch Conditional mnemonic, either basic or extended, that tests either the 
Count Register or a CR bit (but not both). Assemblers should use 0b00 as the default value for the “at” bits, 
indicating that software has offered no prediction. 


Examples 


1. Branch if CRO reflects condition “less than”, specifying that the branch should be predicted to be taken. 
blt+ target 

2. Same as (1), but target address is in the Link Register and the branch should be predicted not to be taken. 
bltlr— 
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B.3 Condition Register Logical Mnemonics 


The Condition Register Logical instructions can be used to set (to 1), clear (to 0), copy, or invert a given Condition 
Register bit. Extended mnemonics are provided that allow these operations to be coded easily. 


Table 5. Condition Register logical mnemonics 


Operation Extended Mnemonic Equivalent to 


Condition Register set crset bx creqv bx,bx,bx 
Condition Register clear crclr bx crxor bx,bx,bx 
Condition Register move crmove bx,by cror bx,by,by 


Condition Register not crnot bx,by crnor bx,by,by 
The symbols defined in Section B.1 can be used to identify the Condition Register bits. 


Examples 


1. Set CR bit 25. 


crset 25 (equivalent to: creqv 25,25,25) 
2. Clear the SO bit of CRO. 

crelr so (equivalent to: crxor  3,3,3) 
3. Same as (2), but SO bit to be cleared is in CR3. 

crclr  4*cr3+s0 (equivalent to: crxor 15,15,15) 
4. Invert the EQ bit. 

crnot  eg,eq (equivalent to: crnor 2,2,2) 


5. Same as (4), but EQ bit to be inverted is in CR4, and the result is to be placed into the EQ bit of CR5. 


crnot 4*cr5+eq,4*cr4+eq (equivalent to: crnor 22,18,18) 


B.4 Subtract Mnemonics 


B.4.1 Subtract Immediate 


Although there is no “Subtract Immediate” instruction, its effect can be achieved by using an Add Immediate 
instruction with the immediate operand negated. Extended mnemonics are provided that include this negation, 
making the intent of the computation clearer. 


subi Rx,Ry,value (equivalent to: addi Rx,Ry,— value) 
subis Rx,Ry,value (equivalent to: addis Rx,Ry,- value) 
subic Rx,Ry,value (equivalent to: addic Rx,Ry,- value) 
subic. Rx,Ry,value (equivalent to: addic. Rx,Ry,— value) 
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B.4.2 Subtract 


The Subtract From instructions subtract the second operand (RA) from the third (RB). Extended mnemonics are 
provided that use the more “normal” order, in which the third operand is subtracted from the second. Both these 
mnemonics can be coded with a final “o” and/or “.” to cause the OE and/or Rc bit to be set in the underlying 
instruction. 


sub Rx,Ry,Rz (equivalent to: subf  Rx,Rz, Ry) 
subc Rx,Ry,Rz (equivalent to: subfc Rx,Rz,Ry) 


B.5 Compare Mnemonics 


The L field in the fixed-point Compare instructions controls whether the operands are treated as 64-bit quantities 
or as 32-bit quantities. Extended mnemonics are provided that represent the L value in the mnemonic rather than 
requiring it to be coded as a numeric operand. 


The BF field can be omitted if the result of the comparison is to be placed into CR Field 0. Otherwise the target 
CR field must be specified as the first operand. One of the CR field symbols defined in Section B.1 can be used 
for this operand. 


Note: The basic Compare mnemonics of PowerPC AS are the same as those of POWER, but the POWER 
instructions have three operands while the PowerPC AS instructions have four. The Assembler will recognize a 
basic Compare mnemonic with three operands as the POWER form, and will generate the instruction with L=0. 
(Thus the Assembler must require that the BF field, which normally can be omitted when CR Field 0 is the target, 
be specified explicitly if L is.) 
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B.5.1 Doubleword Comparisons 


Table 6. Doubleword compare mnemonics 
Operation 


Extended Mnemonic 


Equivalent to 


Compare doubleword immediate 


cmpdi bf,ra,si 


cmpi bf,1,ra,si 


Compare doubleword 


Compare logical doubleword immediate cmpldi bf,ra,ui 


Compare logical doubleword 


cmpd bf,ra,rb 


cmpld bf,ra,rb 


cmp bf,1,ra,rb 
cmpli bf,1,ra,ui 


cmpl bf,1,ra,rb 


Examples 


1. Compare register Rx and immediate value 100 as unsigned 64-bit integers and place result into CRO. 


cmpldi Rx,100 
2. Same as (1), but place result into CR4. 
cmpldi cr4,Rx,100 


(equivalent to: cmpli — 0,1,Rx,100) 


(equivalent to: cmpli 4,1,Rx,100) 


3. Compare registers Rx and Ry as signed 64-bit integers and place result into CRO. 


cmpd Rx,Ry 


B.5.2 Word Comparisons 


(equivalent to: cmp 


0,1,Rx,Ry) 


Table 7. Word compare mnemonics 


Operation 


Extended Mnemonic 


Compare word immediate cmpwi bf,ra,si 


Compare word 


cmpw bf,ra,rb 


Equivalent to 
cmpi bf,O,ra,si 


cmp bf,0,ra,rb 


Compare logical word immediate 


cmplwi bf,ra,ui 


cmpli bf,0,ra,ui 


Compare logical word 


cmplw bf,ra,rb 


cmpl bf,0,ra,rb 


Examples 


1. Compare bits 32:63 of register Rx and immediate value 100 as signed 32-bit integers and place result into 


CRO. 
cmpwi Rx,100 
2. Same as (1), but place result into CR4. 
cmpwi cr4,Rx,100 


3. Compare bits 32:63 of registers Rx and Ry as unsigned 32-bit integers and place result into CRO. 


cmplw Rx,Ry 
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(equivalent to: cmpi 


(equivalent to: cmpl 


0,0,Rx,100) 


4,0,Rx,100) 


0,0,Rx,Ry) 
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B.6 Trap Mnemonics 


The mnemonics defined in Table 8 are variations of the Trap instructions, with the most useful values of TO 
represented in the mnemonic rather than specified as a numeric operand. 


A standard set of codes has been adopted for the most common combinations of trap conditions. 


Code Meaning TO encoding <>=<5 
It Less than 16 10000 
le Less than or equal 20 10100 
eq Equal 4 00100 
ge Greater than or equal 12 01100 
gt Greater than 8 01000 
nl Not less than 12 01100 
ne Not equal 24 11000 
ng Not greater than 20 10100 
lit Logically less than 2 00010 
lle Logically less than or equal 6 00110 
Ige Logically greater than or equal 5 00101 
Igt Logically greater than 1 00001 
Ini Logically not less than 5 00101 
Ing Logically not greater than 6 00110 
u Unconditionally with parameters 31 dex PST E a 
(none) Unconditional 31 1111 1 


These codes are reflected in the mnemonics shown in Table 8. 


Table 8. Trap mnemonics 


64-bit Comparison 


tdi td 
Immediate Register 


32-bit Comparison 


Trap Semantics tw 


Register 


twi 
Immediate 


Trap unconditionally 


Trap unconditionally with parameters 


Trap if less than 


Trap if equal 


Trap if greater than or equal twge 
Trap if greater than tdgti tdgt twgti twgt 

twnl 
Trap if not equal tdnei tdne twnei twne 
Trap if not greater than tdngi tdng twngi twng 
Trap if logically less than tdllti tdllt twllti twllt 
Trap if logically greater than or equal tdigei tdlge twlgei twlge 
Trap if logically greater than tdlgti tdlgt twlgti twlgt 
Trap if logically not less than tdlnli tdini twinli twin! 


Examples 


1. Trap if register Rx is not 0. 
tdnei Rx,0 (equivalent to: tdi 24,Rx,0) 
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2. Same as (1), but comparison is to register Ry. 
tdne Rx,Ry (equivalent to: td 24,Rx,Ry) 
3. Trap if bits 32:63 of register Rx, considered as a 32-bit quantity, are logically greater than Ox7FF. 
twlgti Rx,0x7FF (equivalent to: twi 1,Rx,0x7FF) 
4. Trap unconditionally. 
trap (equivalent to: tw 31,0,0) 
5. Trap unconditionally with immediate parameters Rx and Ry 
tdu Rx, Ry (equivalent to: td 31,Rx,Ry) 


B.7 Rotate and Shift Mnemonics 


The Rotate and Shift instructions provide powerful and general ways to manipulate register contents, but can be 
difficult to understand. Extended mnemonics are provided that allow some of the simpler operations to be coded 
easily. 


Mnemonics are provided for the following types of operation. 


Extract Select a field of n bits starting at bit position b in the source register; left or right justify this field in 
the target register; clear all other bits of the target register to 0. 


Insert Select a left-justified or right-justified field of n bits in the source register; insert this field starting at 
bit position b of the target register; leave other bits of the target register unchanged. (No extended 
mnemonic is provided for insertion of a left-justified field when operating on doublewords, because 
such an insertion requires more than one instruction.) 


Rotate Rotate the contents of a register right or left n bits without masking. 
Shift Shift the contents of a register right or left n bits, clearing vacated bits to O (logical shift). 
Clear Clear the leftmost or rightmost n bits of a register to 0. 


Clear left and shift left 
Clear the leftmost b bits of a register, then shift the register left by n bits. This operation can be used 
to scale a (known nonnegative) array index by the width of an element. 


B.7.1 Operations on Doublewords 


All these mnemonics can be coded with a final “.” to cause the Rc bit to be set in the underlying instruction. 


Table 9. Doubleword rotate and shift mnemonics 


Operation Extended Mnemonic Equivalent to 

Extract and left justify immediate extldi ra,rs,n,b (n> 0) ridicr ra,rs,b,n- 1 
Extract and right justify immediate extrdi ra,rs,n,o (n> 0) ridicl ra,rs,b+n,64- n 
Insert from right immediate insrdi ra,rs,n,o (n> 0) ridimi ra,rs,64— (b+n),b 
Rotate left immediate rotldi ra,rs,n ridicl ra,rs,n,O 

Rotate right immediate rotrdi ra,rs,n ridicl ra,rs,64— n,0 
Rotate left rotld ra,rs,rb rldcl ra,rs,rb,0 

Shift left immediate sldi ra,rs,n (n < 64) ridicr ra,rs,n,63- n 
Shift right immediate srdi ra,rs,n (n < 64) ridicl ra,rs,64— n,n 
Clear left immediate clridi rars,n (n < 64) ridicl ra,rs,0,n 

Clear right immediate clrrdi ra,rs,n (n < 64) ridicr ra,rs,0,63- n 
Clear left and shift left immediate clrisldi ra,rsb,n (n x b< 64) ridic ra,rs,n,b- n 
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Examples 


1. Extract the sign bit (bit 0) of register Ry and place the result right-justified into register Rx. 


extrdi Rx,Ry,1,0 (equivalent to: ridicl Rx,Ry,1,63) 
2. Insert the bit extracted in (1) into the sign bit (bit 0) of register Rz. 

insrdi Rz,Rx,1,0 (equivalent to: ridimi Rz,Rx,63,0) 
3. Shift the contents of register Rx left 8 bits. 

sldi Rx,Rx,8 (equivalent to: ridicr Rx,Rx,8,55) 


4. Clear the high-order 32 bits of register Ry and place the result into register Rx. 
ciridi Rx,Ry,32 (equivalent to: ridicl Rx,Ry,0,32) 


B.7.2 Operations on Words 


All these mnemonics can be coded with a final “.” to cause the Rc bit to be set in the underlying instruction. The 
operations as described above apply to the low-order 32 bits of the registers, as if the registers were 32-bit regis- 
ters. The Insert operations either preserve the high-order 32 bits of the target register or place rotated data 
there; the other operations clear these bits. 


Table 10. Word rotate and shift mnemonics 
Operation Extended Mnemonic Equivalent to 
Extract and left justify immediate extlwi ra,rs,n,o (n> 0) rlwinm ra,rs,b,0,n— 1 
Extract and right justify immediate extrwi ra,rs,n,b (n> 0) rlwinm ra,rs,b«n,32- n,31 
Insert from left immediate inslwi ra,rs,n,o (n> 0) rlwimi ra,rs,32— b,b,(b+n)- 1 
Insert from right immediate insrwi ra,rs,n,o (n> 0) rlwimi ra,rs,32— (b+n),b,(b+n)- 1 
Rotate left immediate rotlwi ra,rs,n rlwinm ra,rs,n,0,31 
Rotate right immediate rotrwi ra,rs,n rlwinm ra,rs,32- n,0,31 
Rotate left rotlw ra,rs,rb riwnm ra,rs,rb,0,31 
Shift left immediate slwi ra,rs,n (n < 32) rlwinm ra,rs,n,0,31- n 
Shift right immediate srwi rars,n (n < 32) rlwinm ra,rs,32- n,n,31 
Clear left immediate clrlwi ra,rs,n (n < 32) rlwinm ra,rs,0,n,31 
Clear right immediate clrrwi rars,n (n < 32) rlwinm ra,rs,0,0,31- n 
Clear left and shift left immediate clrislwi ra,rs,b,n (n € b « 32) rlwinm ra,rs,n,b- n,31- n 
Examples 


1. Extract the sign bit (bit 32) of register Ry and place the result right-justified into register Rx. 

extrwi | RRy,1,0 (equivalent to: rlwinm Rx,Ry,1,31,31) 
2. Insert the bit extracted in (1) into the sign bit (bit 32) of register Rz. 

insrwi | RZ Rx,1,0 (equivalent to: riwimi Rz,Rx,31,0,0) 
3. Shift the contents of register Rx left 8 bits, clearing the high-order 32 bits. 

slwi Rx,Rx,8 (equivalent to: rlwinm Rx,Rx,8,0,23) 


4. Clear the high-order 16 bits of the low-order 32 bits of register Ry and place the result into register Rx, 
clearing the high-order 32 bits of register Rx. 


clrlwi Rx,Ry,16 (equivalent to: rlwinm Rx,Ry,0,16,31) 
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B.8 Move To/From Special Purpose Register Mnemonics 


The mtspr and mfspr instructions specify a Special Purpose Register (SPR) as a numeric operand. Extended mne- 
monics are provided that represent the SPR in the mnemonic rather than requiring it to be coded as an operand. 


Table 11. Extended mnemonics for moving to/from an SPR 


: Move To SPR Move From SPR 
Special Purpose Register 


Extended Equivalent to Extended Equivalent to 


Fixed-Point Exception Register (XER) mtxer Rx mtspr 1,Rx mfxer Rx mfspr Rx,1 


Link Register (LR) mtlr Rx mtspr 8,Rx mflr Rx mfspr Rx,8 


Count Register (CTR) mtctr Rx mtspr 9,Rx mfctr Rx mfspr Rx,9 


Examples 


1. Copy the contents of register Rx to the XER. 

mixer Rx (equivalent to: mtspr 1,Rx) 
2. Copy the contents of the LR to register Rx. 

mflr Rx (equivalent to: mfspr Rx,8) 
3. Copy the contents of register Rx to the CTR. 


mtctr Rx (equivalent to: mtspr 9,Rx) 


B.9 Miscellaneous Mnemonics 


No-op 

Many PowerPC AS instructions can be coded in a way such that, effectively, no operation is performed. An 
extended mnemonic is provided for the preferred form of no-op. If an implementation performs any type of run- 
time optimization related to no-ops, the preferred form is the no-op that will trigger this. 


nop (equivalent to: ori 0,0,0) 
Load Immediate 
The addi and addis instructions can be used to load an immediate value into a register. Extended mnemonics are 
provided to convey the idea that no addition is being performed but merely data movement (from the immediate 
field of the instruction to a register). 
Load a 16-bit signed immediate value into register Rx. 

li Rx,value (equivalent to: addi Rx,0,value) 


Load a 16-bit signed immediate value, shifted left by 16 bits, into register Rx. 


lis Rx,value (equivalent to: addis Rx,0,value) 
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Load Address 


This mnemonic permits computing the value of a base-displacement operand, using the addi instruction which 
normally requires separate register and immediate operands. 


la Rx,D(Ry) (equivalent to: addi Rx,Ry,D) 
The la mnemonic is useful for obtaining the address of a variable specified by name, allowing the Assembler to 
supply the base register number and compute the displacement. If the variable v is located at offset Dv bytes 
from the address in register Rv, and the Assembler has been told to use register Rv as a base for references to 
the data structure containing v, then the following line causes the address of v to be loaded into register Rx. 


la Rx,v (equivalent to: addi | Rx,Rv,Dv) 


Move Register 

Several PowerPC AS instructions can be coded in a way such that they simply copy the contents of one register to 
another. An extended mnemonic is provided to convey the idea that no computation is being performed but 
merely data movement (from one register to another). 


The following instruction copies the contents of register Ry to register Rx. This mnemonic can be coded with a 
final “.” to cause the Rc bit to be set in the underlying instruction. 


mr Rx,Ry (equivalent to: or Rx, Ry,Ry) 


Complement Register 

Several PowerPC AS instructions can be coded in a way such that they complement the contents of one register 
and place the result into another register. An extended mnemonic is provided that allows this operation to be 
coded easily. 


The following instruction complements the contents of register Ry and places the result into register Rx. This 
mnemonic can be coded with a final “.” to cause the Rc bit to be set in the underlying instruction. 


not Rx,Ry (equivalent to: nor  Rx,Ry,Ry) 


Move To Condition Register 


This mnemonic permits copying the contents of the low-order 32 bits of a GPR to the Condition Register, using the 
same style as the mfcr instruction. 


mtcr Rx (equivalent to: mtcrf OxFF,Rx) 
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Appendix C. Programming Examples 


C.1 Multiple-Precision Shifts 


This section gives examples of how multiple-precision 
shifts can be programmed. 


A multiple-precision shift is defined to be a shift of an 
N-doubleword quantity (64-bit mode) or an N-word 
quantity (32-bit mode), where N> 1. The quantity to be 
shifted is contained in N registers. The shift amount 
is specified either by an immediate value in the 
instruction or by a value in a register. 


The examples shown below distinguish between the 
cases N=2 and N>2. If N=2, the shift amount may be 
in the range O through 127 (64-bit mode) or O through 
63 (32-bit mode), which are the maximum ranges sup- 
ported by the Shift instructions used. However if 
N>2, the shift amount must be in the range 0 through 
63 (64-bit mode) or O through 31 (32-bit mode), in 
order for the examples to yield the desired result. 
The specific instance shown for N>2 is N=3; 
extending those code sequences to larger N is 
straightforward, as is reducing them to the case N=2 


when the more stringent restriction on shift amount is 
met. For shifts with immediate shift amounts only the 
case N=3 is shown, because the more stringent 
restriction on shift amount is always met. 


In the examples it is assumed that GPRs 2 and 3 (and 
4) contain the quantity to be shifted, and that the 
result is to be placed into the same registers, except 
for the immediate left shifts in 64-bit mode for which 
the result is placed into GPRs 3, 4, and 5. In all 
cases, for both input and result, the lowest-numbered 
register contains the highest-order part of the data 
and highest-numbered register contains the lowest- 
order part. For non-immediate shifts, the shift 
amount is assumed to be in GPR 6. For immediate 
shifts, the shift amount is assumed to be greater than 
O. GPRs 0 and 31 are used as scratch registers. 


For N> 2, the number of instructions required is 2N- 1 
(immediate shifts) or 3N- 1 (non-immediate shifts). 
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Multiple-precision shifts in 64-bit mode Multiple-precision shifts in 32-bit mode 


Shift Left Immediate, N = 3 (shift amnt < 64) Shift Left Immediate, N = 3 (shift amnt < 32) 


ridicr r5,r4,sh,63-sh rlwinm r2,r2,sh,0,31-sh 
ridimi r4,r3,0,sh rlwimi r2,r3,sh,32-sh,31 
ridicl r4,r4,sh,0 rlwinm r3,r3,sh,0,31-sh 
ridimi r3,r2,0,sh rlwimi r3,r4,sh,32-sh,31 
ridicl r3,r3,sh,0 rlwinm r4,r4,sh,0,31-sh 


Shift Left, N = 2 (shift amnt < 128) 


Shift Left, N = 2 (shift amnt < 64) 


subfic r31,r6,64 subfic r31,r6,32 
sid r2,r2,r6 slw r2,r2,r6 
srd r0,r3,r31 srw r0,r3,r31 
or r2,r2,r0 or r2,r2,r0 
addi r31,r6,-64 addi r31,r6,-32 
sid r0,r3,r31 slw r0,r3,r31 
or r2,r2,r0 or r2,r2,r0 
sid r3,r3,r6 slw r3,r3,r6 
Shift Left, N = 3 (shift amnt < 64) Shift Left, N = 3 (shift amnt < 32) 
subfic r31,r6,64 subfic r31,r6,32 
sid r2,r2,r6 slw r2,r2,r6 
srd r0,r3,r31 srw r0,r3,r31 
or r2,r2,r0 or r2,r2,r0 
sid r3,r3,r6 siw r3,r3,r6 
srd r0,r4,r31 srw r0,r4,r31 
or r3,r3,r0 or r3,r3,r0 
sid r4,r4,r6 slw r4,r4,r6 


Shift Right Immediate, N = 3 (shift amnt < 64) 


Shift Right Immediate, N = 3 (shift amnt < 32) 


ridimi r4,r3,0,64-sh rlwinm r4,r4,32-sh,sh,31 
ridicl r4,r4,64-sh,0 rlwimi r4,r3,32-sh,0,sh-1 
ridimi r3,r2,0,64-sh rlwinm r3,r3,32-sh,sh,31 
ridicl r3,r3,64-sh,0 rlwimi r3,r2,32-sh,0,sh-1 
ridicl r2,r2,64-sh,sh rlwinm r2,r2,32-sh,sh,31 


Shift Right, N = 2 (shift amnt< 128) 


Shift Right, N = 2 (shift amnt < 64) 


subfic r31,r6,64 subfic r31,r6,32 
srd r3,r3,r6 srw r3,r3,r6 
sid r0,r2,r31 slw r0,r2,r31 
or r3,r3,r0 or r3,r3,r0 
addi r31,r6,-64 addi r31,r6,-32 
srd r0,r2,r31 srw r0,r2,r31 
or r3,r3,r0 or r3,r3,r0 
srd r2,r2,r6 srw r2,r2,r6 


Shift Right, N = 3 (shift amnt < 64) 


Shift Right, N = 3 (shift amnt « 32) 


subfic r31,r6,64 subfic r31,r6,32 
srd r4,r4,r6 srw r4,r4,r6 
sid r0,r3,r31 slw r0,r3,r31 
or r4,r4,r0 or r4,r4,r0 
srd r3,r3,r6 srw r3,r3,r6 
sid r0,r2,r31 slw r0,r2,r31 
or r3,r3,r0 or r3,r3,r0 
srd r2,r2,r6 srw r2,r2,r6 
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Multiple-precision shifts in 64-bit mode, 
continued 


Multiple-precision shifts in 32-bit mode, 
continued 


Shift Right Algebraic Immediate, N = 3 (shift amnt « 64) Shift Right Algebraic Immediate, N = 3 (shift amnt « 32) 


ridimi r4,r3,0,64-sh rlwinm r4,r4,32-sh,sh,31 
ridicl r4,r4,64-sh,0 rlwimi r4,r3,32-sh,0,sh-1 
ridimi r3,r2,0,64-sh rlwinm r3,r3,32-sh,sh,31 
ridicl r3,r3,64-sh,0 rlwimi r3,r2,32-sh,0,sh-1 
sradi r2,r2,sh srawi r2,r2,sh 


Shift Right Algebraic, N = 2 (shift amnt < 128) 


Shift Right Algebraic, N = 2 (shift amnt « 64) 


subfic r31,r6,64 subfic r31,r6,32 
srd r3,r3,r6 srw r3,r3,r6 
sid r0,r2,r31 slw r0,r2,r31 
or r3,r3,r0 or r3,r3,r0 
addic. r31,r6,-64 addic. r31,r6,-32 
srad r0,r2,r31 sraw r0,r2,r31 
ble $+8 ble $+8 

ori r3,r0,0 ori r3,r0,0 
srad r2,r2,r6 sraw r2,r2,r6 


Shift Right Algebraic, N = 3 (shift amnt « 64) Shift Right Algebraic, N = 3 (shift amnt < 32) 


subfic r31,r6,64 subfic r31,r6,32 
srd r4,r4,r6 srw r4,r4,r6 
sid r0,r3,r31 slw r0,r3,r31 
or r4,r4,r0 or r4,r4,r0 
srd r3,r3,r6 srw r3,r3,r6 
sid r0,r2,r31 slw r0,r2,r31 
or r3,r3,r0 or r3,r3,r0 
srad r2,r2,r6 sraw r2,r2,r6 
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C.2 Floating-Point Conversions 


This section gives examples of how the Floating-Point 
Conversion instructions can be used to perform 
various conversions. 


C.2.1 Conversion from 
Floating-Point Number to 
Floating-Point Integer 


The full convert to floating-point integer function can 
be implemented with the sequence shown below, 
assuming the floating-point value to be converted is 
in FPR 1 and the result is returned in FPR 3. 


mtfsb0 23 #clear VXCVI 
fctid[z] £3,f1 #convert to fx int 
fcfid [3,13 #convert back again 
mcrfs ad; #VXCVI to CR 

bf 31,$+8 #skip if VXCVI was 0 
fmr £3, 41 #input was fp int 


C.2.2 Conversion from Floating-Point 
Number to Signed Fixed-Point Integer 
Doubleword 


The full convert to signed fixed-point integer 
doubleword function can be implemented with the 
sequence shown below, assuming the floating-point 
value to be converted is in FPR 1, the result is 
returned in GPR 3, and a doubleword at displacement 
“disp” from the address in GPR 1 can be used as 
scratch space. 


fctid[z] £2,f1 


stfd f2,disp (r1) 
ld r3,disp(r1) 


convert to dword int 
#store float 
#load dword 
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Warning: Some of the examples use the optional fsel 
instruction. Care must be taken in using fsel if IEEE 
compatibility is required, or if the values being tested 
can be NaNs or infinities; see Section C.3.4, “Notes” 
on page 160. 


C.2.3 Conversion from Floating-Point 
Number to Unsigned Fixed-Point 
Integer Doubleword 


The full convert to unsigned fixed-point integer 
doubleword function can be implemented with the 
sequence shown below, assuming the floating-point 
value to be converted is in FPR 1, the value 0 is in 
FPR 0, the value 284- 2048 is in FPR 3, the value 283 is 
in FPR 4 and GPR 4, the result is returned in GPR 3, 
and a doubleword at displacement “disp” from the 
address in GPR 1 can be used as scratch space. 


fsel £2,f1,f1, £0 
fsub £5,f3,f1 
fsel £2,£5,£2, £3 
fsub £5, f2, £4 
fcmpu cr2, £2, £4 
fsel £2,£5, £5, £2 


use 0 if < 0 
use max if > max 


subtract 2**63 
use diff if > 2**63 


fctid[z] f2, f2 convert to fx int 
stfd f2,disp (r1) store float 

ld r3,disp(r1) load dword 

blt cr2,$+8 add 2**63 if input 
add r3,r3,r4 was 2 2**63 


C.2.4 Conversion from Floating-Point 
Number to Signed Fixed-Point Integer 
Word 


The full convert to signed fixed-point integer word 
function can be implemented with the sequence 
shown below, assuming the floating-point value to be 
converted is in FPR 1, the result is returned in GPR 3, 
and a doubleword at displacement “disp” from the 
address in GPR 1 can be used as scratch space. 


fctiw[z] £2,f1 #convert to fx int 
stfd f2,disp(rl)  #store float 
lwa r3,disp*4(rl) fload word algebraic 
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C.2.5 Conversion from Floating-Point 
Number to Unsigned Fixed-Point 
Integer Word 


The full convert to unsigned fixed-point integer word 
function can be implemented with the sequence 
shown below, assuming the floating-point value to be 
converted is in FPR 1, the value O is in FPR 0, the 
value 22-14 is in FPR 3, the result is returned in GPR 
3, and a doubleword at displacement “disp” from the 
address in GPR 1 can be used as scratch space. 


fsel £2, f1,f1,f0 fuse 0 if < 0 

fsub £4,£3,f1 fuse max if > max 
fsel £2: 14.23 

fctid[z] £2, £2 #convert to fx int 
stfd f2,disp(rl) #store float 

lwz r3,disp*4(rl) #load word and zero 


C.2.6 Conversion from Signed 
Fixed-Point Integer Doubleword to 
Floating-Point Number 


The full convert from signed fixed-point integer 
doubleword function, using the rounding mode speci- 
fied by FPSCRay, can be implemented with the 
sequence shown below, assuming the fixed-point 
value to be converted is in GPR 3, the result is 
returned in FPR 1, and a doubleword at displacement 
"disp" from the address in GPR 1 can be used as 
scratch space. 


std r3,disp(rl)  fstore dword 
lfd fl,disp(rl)  fload float 
fcfid f1,fl #convert to fp int 


C.2.7 Conversion from Unsigned 
Fixed-Point Integer Doubleword to 
Floating-Point Number 


The full convert from unsigned fixed-point integer 
doubleword function, using the rounding mode speci- 
fied by FPSCRan, can be implemented with the 
sequence shown below, assuming the fixed-point 
value to be converted is in GPR 3, the value 29? is in 
FPR 4, the result is returned in FPR 1, and two 
doublewords at displacement “disp” from the address 
in GPR 1 can be used as scratch space. 


rldicl  r2,r3,32,32 isolate high half 
rldicl  r0,r3,0,32 isolate low half 

std r2,disp(r1) store dword both 

std r0, disp+8 (r1) 

lfd £2, disp (r1) load float both 

lfd fl, disp+8 (r1) 

fcfid f2,£2 convert each half to 
fcfid f1,fl fp int (exact result) 
fmadd £1,f4, £2, f1 (2**32) *high * low 


An alternative, shorter, sequence can be used if 
rounding according to FSCPRgy is desired and 
FPSCRgy specifies Round toward + Infinity or Round 
toward — Infinity, or if it is acceptable for the rounded 
answer to be either of the two representable floating- 
point integers nearest to the given fixed-point integer. 
In this case the full convert from unsigned fixed-point 
integer doubleword function can be implemented with 
the sequence shown below, assuming the value 294 is 
in FPR 2. 


std r3,disp(rl)  fstore dword 
lfd fl, disp (r1) tload float 
fcfid f1,fl #convert to fp int 


fadd £4,f1,f2 tadd 2**64 
fsel f1,f1,f1,f4 $ if r3 <0 


C.2.8 Conversion from Signed 
Fixed-Point Integer Word to 
Floating-Point Number 


The full convert from signed fixed-point integer word 
function can be implemented with the sequence 
shown below, assuming the fixed-point value to be 
converted is in GPR 3, the result is returned in FPR 1, 
and a doubleword at displacement “disp” from the 
address in GPR 1 can be used as scratch space. (The 
result is exact.) 


ext sw ES, 13 #extend sign 


std r3,disp(rl)  fstore dword 
lfd fl,disp(rl) #load float 
fcfid f1,f1 #convert to fp int 


C.2.9 Conversion from Unsigned 
Fixed-Point Integer Word to 
Floating-Point Number 


The full convert from unsigned fixed-point integer 
word function can be implemented with the sequence 
shown below, assuming the fixed-point value to be 
converted is in GPR 3, the result is returned in FPR 1, 
and a doubleword at displacement “disp” from the 
address in GPR 1 can be used as scratch space. (The 
result is exact.) 


ridici. r0,r3,0,32 #zero-extend 
std r0, disp (r1) #store dword 
lfd fl,disp(rl)  fload float 
fcfid f1,fl #convert to fp int 
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C.3 Floating-Point Selection 


This section gives examples of how the optional 
Floating Select instruction can be used to implement 
floating-point minimum and maximum functions, and 
certain simple forms of if-then-else constructions, 
without branching. 


The examples show program fragments in an imagi- 
nary, C-like, high-level programming language, and 
the corresponding program fragment using fsel and 
other PowerPC AS instructions. In the examples, a, b, 


C.3.1 Comparison to Zero 
High-level language: PowerPC AS: Notes 
if a 2 0.0 then x € y fsel fx,fa,fy,fz (1) 
else x € z 
if a> 0.0 then x € y fneg fs,fa (1,23 
else x € z fsel fx,fs,fz,fy 
if a= 0.0 then x € y fsel fx,fa,fy,fz (1) 
else x ¢z fneg fs,fa 
fsel fx,fs,fx,fz 
C.3.2 Minimum and Maximum 
High-level language: PowerPC AS: Notes 
x € min (a,b) fsub fs,fa,fb (3,4,5) 
fsel fx,fs,fb,fa 
x € max (a,b) fsub fs,fa,fb (3,4,5) 
fsel fx,fs,fa,fb 
C.3.3 Simple if-then-else 
Constructions 
High-level language: PowerPC AS: Notes 
ifa>bthenxty fsub fs,fa,fb (4,5) 
else x € z fsel fx,fs,fy,fz 
if a > b then x ty fsub fs,fb,fa (3,4,5) 
else x € z fsel fx,fs,fz,fy 
ifa=bthenxty fsub fs,fa,fb (4,5) 
else x € z fsel fx,fs,fy,fz 
fneg fs,fs 
fsel fx,fs,fx,fz 


x, y, and z are floating-point variables, which are 
assumed to be in FPRs fa, fb, fx, fy, and fz. FPR fs is 
assumed to be available for scratch space. 


Additional examples can be found in Section C.2, 
“Floating-Point Conversions” on page 158. 


Warning: Care must be taken in using fsel if IEEE com- 
patibility is required, or if the values being tested can 
be NaNs or infinities; see Section C.3.4. 


C.3.4 Notes 


The following Notes apply to the preceding examples 
and to the corresponding cases using the other three 
arithmetic relations (<, <, and #). They should also 
be considered when any other use of fsel is contem- 
plated. 


In these Notes, the “optimized program" is the 
PowerPC AS program shown, and the “unoptimized 
program” (not shown) is the corresponding PowerPC 
AS program that uses fempu and Branch Conditional 
instructions instead of fsel. 


1. The unoptimized program affects the VXSNAN bit 
of the FPSCR, and therefore may cause the 
system error handler to be invoked if the corre- 
sponding exception is enabled, while the opti- 
mized program does not affect this bit. This 
property of the optimized program is incompat- 
ible with the IEEE standard. 


2. The optimized program gives the incorrect result 
if a is a NaN. 


3. The optimized program gives the incorrect result 
if a and/or b is a NaN (except that it may give the 
correct result in some cases for the minimum and 
maximum functions, depending on how those 
functions are defined to operate on NaNs). 


4. The optimized program gives the incorrect result 
if a and b are infinities of the same sign. (Here it 
is assumed that Invalid Operation Exceptions are 
disabled, in which case the result of the sub- 
traction is a NaN. The analysis is more compli- 
cated if Invalid Operation Exceptions are enabled, 
because in that case the target register of the 
subtraction is unchanged.) 


5. The optimized program affects the OX, UX, XX, 
and VXISI bits of the FPSCR, and therefore may 
cause the system error handler to be invoked if 
the corresponding exceptions are enabled, while 
the unoptimized program does not affect these 
bits. This property of the optimized program is 
incompatible with the IEEE standard. 
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Appendix D. Cross-Reference for Changed POWER 


Mnemonics 


The following table lists the POWER instruction mne- 
monics that have been changed in the PowerPC AS 
Architecture, sorted by POWER mnemonic. 


To determine the PowerPC AS mnemonic for one of 
these POWER mnemonics, find the POWER mnemonic 
in the second column of the table: the remainder of 
the line gives the PowerPC AS mnemonic and the 
page or Book in which the instruction is described, as 
well as the instruction names. A page number is 
shown for instructions that are defined in this Book 


Architecture), and the Book number is shown for 
instructions that are defined in other Books (Book Il, 
PowerPC AS Virtual Environment Architecture, and 
Book Ill, PowerPC AS Operating Environment Archi- 
tecture). If an instruction is defined in more than one 
of these Books, the lowest-numbered Book is used. 


POWER mnemonics that have not changed are not 
listed. POWER instruction names that are the same in 
PowerPC AS are not repeated; i.e., for these, the last 
column of the table is blank. 


(Book |, PowerPC AS User Instruction Set 
Bk | Mnemonic Instruction Mnemonic Instruction 
51 ja[o][.] Add addc[o][.] Add Carrying 
52 |ae[o][.] Add Extended adde[0][.] 
50 Jai Add Immediate addic Add Immediate Carrying 
50 |ai. Add Immediate and Record addic. Add Immediate Carrying and Record 
52 lame[o][.] Add To Minus One Extended addme[o][.] |Add to Minus One Extended 
62 |andil. AND Immediate Lower andi. AND Immediate 
62 |andiu. AND Immediate Upper andis. AND Immediate Shifted 
53 |aze[o][.] Add To Zero Extended addze[o][.] |Add to Zero Extended 
24 |bcc[!] Branch Conditional to Count Register |bcctr[I] 
24 bcr[l] Branch Conditional to Link Register  |bclr[l] 
49 [cal Compute Address Lower addi Add Immediate 
49 |cau Compute Address Upper addis Add Immediate Shifted 
50 |cax[o][.] Compute Address add[0][.] Add 
67 |cntlz[.] Count Leading Zeros cntlzw[.] Count Leading Zeros Word 
Il | dclz Data Cache Line Set to Zero dcbz Data Cache Block set to Zero 
II | dcs Data Cache Synchronize sync Synchronize 
66 |exts[.] Extend Sign extsh[.] Extend Sign Halfword 
106 |fa[.] Floating Add fadd[.] 
107 |fd[.] Floating Divide fdiv[.] 
107 |fm[.] Floating Multiply fmul[.] 
108 |fma[.] Floating Multiply-Add fmadd[.] 
108 |fms[.] Floating Multiply-Subtract fmsub[.] 
109 |fnma[.] Floating Negative Multiply-Add fnmadd[.] 
109 |fnms[.] Floating Negative Multiply-Subtract |fnmsub[.] 
106 |fs[.] Floating Subtract fsub[ .] 
ll [ics Instruction Cache Synchronize isync Instruction Synchronize 
35 || Load Iwz Load Word and Zero 
42 |lbrx Load Byte-Reverse Indexed Iwbrx Load Word Byte-Reverse Indexed 
44 |Im Load Multiple Imw Load Multiple Word 
46 |lsi Load String Immediate Iswi Load String Word Immediate 
46 |lsx Load String Indexed Iswx Load String Word Indexed 
35 |lu Load with Update Iwzu Load Word and Zero with Update 
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Page / POWER PowerPC AS 
Bk | Mnemonic Instruction Mnemonic Instruction 
35 | lux Load with Update Indexed Iwzux Load Word and Zero with Update 
Indexed 
35 |lx Load Indexed Iwzx Load Word and Zero Indexed 
Ill | mtsri Move To Segment Register Indirect |mtsrin 

54 |muli Multiply Immediate mulli Multiply Low Immediate 

54 |muls[o][.] Multiply Short mullw[o][.] Multiply Low Word 

63 loril OR Immediate Lower ori OR Immediate 

63 |oriu OR Immediate Upper oris OR Immediate Shifted 

73 |rlimi[.] Rotate Left Immediate Then Mask rlwimi[.] Rotate Left Word Immediate then 
Insert Mask Insert 

70 |rlinm[.] Rotate Left Immediate Then AND rlwinm[.] Rotate Left Word Immediate then 
With Mask AND with Mask 

72 |rinm[.] Rotate Left Then AND With Mask rlwnm[.] Rotate Left Word then AND with 


51 |sf[o][.] 
52 |sfe[o][.] 
51 |sfi 

52 |sfme[o][.] 
53 |sfze[o][.] 
74 |sl[.] 

75 |sr[.] 

77 |sra[.] 

76 |srai[.] 


40 |st 
43 |stbrx 
44 |stm 
47 |stsi 
47 |stsx 
40 |stu 
40 |stux 
40 |stx 
25 |svca 
61 t 
60 ¡ti 

III tlbi 
63 |xoril 
63 |xoriu 


Subtract From 

Subtract From Extended 

Subtract From Immediate 

Subtract From Minus One Extended 
Subtract From Zero Extended 

Shift Left 

Shift Right 

Shift Right Algebraic 

Shift Right Algebraic Immediate 


Store 

Store Byte-Reverse Indexed 
Store Multiple 

Store String Immediate 
Store String Indexed 
Store with Update 

Store with Update Indexed 
Store Indexed 

Supervisor Call 

Trap 

Trap Immediate 

TLB Invalidate Entry 

XOR Immediate Lower 
XOR Immediate Upper 


subfc[o][.] 
subfe[o][.] 
subfic 
subfme[o][.] 
subfze[0][.] 
slw[.] 

srw[.] 
sraw[.] 
srawi[.] 


stw 
stwbrx 
stmw 
stswi 
Stswx 
stwu 
stwux 
stwx 
sc 

tw 

twi 
tlbie 
xori 
xoris 


Mask 
Subtract From Carrying 


Subtract From Immediate Carrying 


Shift Left Word 

Shift Right Word 

Shift Right Algebraic Word 

Shift Right Algebraic Word Imme- 
diate 

Store Word 

Store Word Byte-Reverse Indexed 
Store Multiple Word 

Store String Word Immediate 
Store String Word Indexed 

Store Word with Update 

Store Word with Update Indexed 
Store Word Indexed 

System Call 

Trap Word 

Trap Word Immediate 


XOR Immediate 
XOR Immediate Shifted 
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Appendix E. Incompatibilities with the POWER Architecture 


This appendix identifies the known incompatibilities 
that must be managed in the migration from the 
POWER Architecture to the PowerPC AS Architecture. 
Some of the incompatibilities can, at least in principle, 
be detected by the processor, which could trap and 
let software simulate the POWER operation. Others 
cannot be detected by the processor even in prin- 
ciple. 


In general, the incompatibilities identified here are 
those that affect a POWER application program; 


E.1 New Instructions, Formerly 
Privileged Instructions 


Instructions new to PowerPC AS typically use opcode 
values (including extended opcode) that are illegal in 
POWER. A few instructions that are privileged in 
POWER (e.g., dclz, called dcbz in PowerPC AS) have 
been made nonprivileged in PowerPC AS. Any 
POWER program that executes one of these now-valid 
or now-nonprivileged instructions, expecting to cause 
the system illegal instruction error handler or the 
system privileged instruction error handler to be 
invoked, will not execute correctly on PowerPC AS. 


E.2 Newly Privileged Instructions 


The following instructions are nonprivileged in POWER 
but privileged in PowerPC AS. 


mfmsr 
mfsr 


E.3 Reserved Bits in Instructions 


These are shown with "/"s in the instruction layouts. 
In POWER such bits are ignored by the processor. In 
PowerPC AS they must be O or the instruction form is 
invalid. 


In several cases the PowerPC AS Architecture 
assumes that such bits in POWER instructions are 
indeed 0. The cases include the following. 


incompatibilities for instructions that can be used only 
by POWER system programs are not necessarily dis- 
cussed. 


References to instructions and facilities that are not 
defined in Book |, PowerPC AS User Instruction Set 
Architecture, apply to an implementation that con- 
forms to Book ll, PowerPC AS Virtual Environment 
Architecture, and Book lll, PowerPC AS Operating 
Environment Architecture. 


m bclr[!] and bectr[]] assume that bits 19:20 in the 
POWER instructions are 0. 

m cmpi, cmp, cmpli, and cmpl assume that bit 10 in 
the POWER instructions is 0. 

m mtspr and mfspr assume that bits 16:20 in the 
POWER instructions are 0. 

m micrf and mfcr assume that bit 11 in the POWER 
instructions is 0. 

m sync assumes that bit 10 in the POWER instruc- 
tion (dcs) is O. 


E.4 Reserved Bits in Registers 


Both POWER and PowerPC AS permit software to 
write any value to these bits. However in POWER 
reading such a bit always returns 0, while in PowerPC 
AS reading it may return either O or the value that 
was last written to it. 


E5 Alignment Check 


The POWER MSR AL bit (bit 24) is no longer sup- 
ported; the corresponding PowerPC AS MSR bit, bit 
56, is reserved. The low-order bits of the EA are 
always used. (Notice that the value 0 — the normal 
value for a reserved bit — means "ignore the low- 
order EA bits" in POWER, and the value 1 means "use 
the low-order EA bits”.) POWER-compatible operating 
system code will probably write the value 1 to this bit. 
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E.6 Condition Register 


The following instructions specify a field in the CR 
explicitly (via the BF field) and also, in POWER, use bit 
31 as the Record bit. In PowerPC AS, if bit 31 = 1 for 
these instructions the instruction form is invalid. In 
POWER, if Rc=1 the instructions execute normally 
except as follows: 


cmp CRO is undefined if Rc=1 and BF+0 
cmpl CRO is undefined if Rc=1 and BF+0 
merxr CRO is undefined if Rc=1 and BF+0 
fcmpu CR1 is undefined if Rc=1 
fcmpo CR1 is undefined if Rc=1 
merfs CR1 is undefined if Rc=1 and BFz 1 


E.7 Inappropriate Use of LK and 
Re Bits 


For the instructions listed below, if bit 31 (LK or Rc bit 
in POWER) is set to 1, POWER executes the instruction 
normally with the exception of setting the Link Reg- 
ister (if LK=1) or Condition Register Field O or 1 (if 
Rc=1) to an undefined value. In PowerPC AS such 
instruction forms are invalid. 


PowerPC AS instructions that are invalid form if bit 31 
= 1 (LK bit in POWER): 


sc (sve in POWER) 

the Condition Register Logical instructions 
merf 

isync (ics in POWER) 


PowerPC AS instructions that are invalid form if bit 31 
= 1 (Rc bit in POWER): 


fixed-point X-form Load and Store instructions 
fixed-point X-form Compare instructions 

the X-form Trap instruction 

mtspr, mfspr, mtcrf, mcrxr, mfcr 

floating-point X-form Load and Store instructions 
floating-point Compare instructions 

mcrfs 

dcbz (dclz in POWER) 


ES BO Field 


POWER shows certain bits in the BO field — used by 
Branch Conditional instructions — as “x”. Although 
the POWER Architecture does not say how these bits 
are to be interpreted, they are in fact ignored by the 
processor. 

PowerPC AS shows these bits as “z”, “a”, or “t”. The 
“z” bits are ignored, as in POWER. However, the “a” 


and “t” bits can be used by software to provide a hint 
about how the branch is likely to behave. If a POWER 
program has the “wrong” value for these bits, the 
program will produce the same results as on POWER 
but performance may be affected. 


E.9 BH Field 


Bits 19:20 of the Branch Conditional to Link Register 
and Branch Conditional to Count Register instructions 
are reserved in POWER but are defined as a branch 
hint (BH) field in PowerPC AS. Because these bits are 
hints, they may affect performance but do not affect 
the results of executing the instruction. 


E.10 Branch Conditional to 
Count Register 


For the case in which the Count Register is decre- 
mented and tested (i.e., the case in which BOp=0), 
POWER specifies only that the branch target address 
is undefined, with the implication that the Count Reg- 
ister, and the Link Register if LK=1, are updated in 
the normal way. PowerPC AS specifies that this 
instruction form is invalid. 


E.11 System Call 


There are several respects in which PowerPC AS is 
incompatible with POWER for System Call instructions 
— which in POWER are called Supervisor Call 
instructions. 


m POWER provides a version of the Supervisor Call 
instruction (bit 30 = 0) that allows instruction 
fetching to continue at any one of 128 locations. 
It is used for “fast SVCs”. PowerPC AS provides 
no such version: if bit 30 of the instruction is O 
the instruction form is invalid. 


m POWER provides a version of the Supervisor Call 
instruction (bits 30:31 = 0b11) that resumes 
instruction fetching at one location and sets the 
Link Register to the address of the next instruc- 
tion. PowerPC AS provides no such version: if bit 
31 of the instruction is 1 the instruction form is 
invalid. 


m For POWER, information from the MSR is saved in 
the Count Register. For PowerPC AS this infor- 
mation is saved in SRR1. 


m POWER permits bits 16:19 and 27:29 of the 
instruction to be nonzero, while in PowerPC AS 
such an instruction form is invalid. 
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Architecture and Engineering Note 


Bits 16:19 and 27:29 should be regarded as 
reserved for POWER. As long as POWER 
compatibility is required for this instruction, 
bits 16:19 and 27:29 should be ignored by the 
processor. 


m POWER saves the low-order 16 bits of the instruc- 
tion, in the Count Register. PowerPC AS does not 
save them. 


m The settings of MSR bits by the associated inter- 
rupt differ between POWER and PowerPC AS; see 
POWER Processor Architecture and Book Ill, 
PowerPC AS Operating Environment Architecture. 


E.12 Fixed-Point Exception 
Register (XER) 


Bits 48:55 of the XER are reserved in PowerPC AS, 
while in POWER the corresponding bits (16:23) are 
defined and contain the comparison byte for the Iscbx 
instruction (which PowerPC AS lacks). 


Engineering Note 


For reasons of compatibility with the POWER 
Architecture, early implementations must set XER 
bits 48:55 from the source value on write, and 
return the value last written to them on read. 


E.13 Update Forms of Storage 
Access Instructions 


PowerPC AS requires that RA not be equal to either 
RT (fixed-point Load only) or O. If the restriction is 
violated the instruction form is invalid. POWER 
permits these cases, and simply avoids saving the 
EA. 


E.14 Multiple Register Loads 


PowerPC AS requires that RA, and RB if present in 
the instruction format, not be in the range of registers 
to be loaded, while POWER permits this and does not 
alter RA or RB in this case. (The PowerPC AS 
restriction applies even if RA=0, although there is no 
obvious benefit to the restriction in this case since RA 
is not used to compute the effective address if 
RA=0.) If the PowerPC AS restriction is violated, 
either the system illegal instruction error handler is 
invoked or the results are boundedly undefined. The 
instructions affected are: 


Imw (Im in POWER) 
Iswi (Isi in POWER) 
Iswx (Isx in POWER) 


For example, an /mw instruction that loads all 32 reg- 
isters is valid in POWER but is an invalid form in 
PowerPC AS. 


E.15 Load/Store Multiple 
Instructions 


There are several respects in which PowerPC AS is 
incompatible with POWER for Load Multiple and Store 
Multiple instructions. 


= lí the EA is not word-aligned, in PowerPC AS 
either an Alignment interrupt occurs or the 
results are boundedly undefined, while in POWER 
an Alignment interrupt occurs if MSRa¡=1 (the 
low-order two bits of the EA are ignored if 
MSRa¡=0). 


Engineering Note 


If attempt is made to execute an Imw or stmw 
instruction having an incorrectly aligned 
effective address, early implementations must 
either correctly transfer the addressed bytes 
or cause an Alignment interrupt, for reasons 
of compatibility with the POWER Architecture. 


m In PowerPC AS the instruction may be interrupted 
by a system-caused interrupt, while in POWER the 
instruction cannot be thus interrupted. 


E.16 Move Assist Instructions 


There are several respects in which PowerPC AS is 
incompatible with POWER for Move Assist 
instructions. 


m In PowerPC AS an Iswx instruction with zero 
length leaves the contents of RT undefined (if 
RT#RA and RT#RB) or is an invalid instruction 
form (if RT=RA or RT=RB), while in POWER the 
corresponding instruction (Isx) is a no-op in these 
cases. 


m In PowerPC AS an Iswx instruction with zero 
length may alter the Reference bit, and a stswx 
instruction with zero length may alter the Refer- 
ence and Change bits, while in POWER the corre- 
sponding instructions (Isx and stsx) do not alter 
the Reference and Change bits in this case. 


= In PowerPC AS a Move Assist instruction may be 
interrupted by a system-caused interrupt, while in 
POWER the instruction cannot be thus inter- 
rupted. 
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E.17 Move To/From SPR 


There are several respects in which PowerPC AS is 
incompatible with POWER for Move To/From Special 
Purpose Register instructions. 


m The SPR field is ten bits long in PowerPC AS, but 
only five in POWER (see also Section E3, 
“Reserved Bits in Instructions” on page 163). 


m mfspr can be used to read the Decrementer in 
problem state in POWER, but only in privileged 
state in PowerPC AS. 


m lí the SPR value specified in the instruction is not 
one of the defined values, POWER behaves as 
follows. 


— |f the instruction is executed in problem state 
and SPRç=1, a Privileged Instruction type 
Program interrupt occurs. No architected 
registers are altered except those set by the 
interrupt. 


— Otherwise no architected registers are 
altered. 


In this same case, PowerPC AS behaves as 
follows. 


— If the instruction is executed in problem state 
and spro=1, either an Illegal Instruction type 
Program interrupt or a Privileged Instruction 
type Program interrupt occurs. No archi- 
tected registers are altered except those set 
by the interrupt. 


— Otherwise either an Illegal Instruction type 
Program interrupt occurs (in which case no 
architected registers are altered except those 
set by the interrupt) or the results are 
boundedly undefined (or possibly undefined, 
for mtspr; see Book III). 


Engineering Note 


For reasons of compatibility with the POWER 
Architecture, early implementations must cause 
an Illegal Instruction type Program interrupt for an 
attempt to execute an mtspr or mfspr instruction 
with SPR=0 (sprg4- 0 denotes the POWER MQ 
register). 


Similarly, early implementations must cause an 
Illegal Instruction type Program interrupt for an 
attempt to execute an mfspr instruction with 
SPR=4 (spro.4=4 denotes reading the Real-Time 
Clock Upper in POWER) SPR=5 (sprg4-5 
denotes reading the Real-Time Clock Lower in 
POWER), or SPR=6 (sprg,4- 6 denotes reading the 
Decrementer in POWER). 


Essentially all POWER programs are expected to 
have bits 16:20 of mtspr and mfspr instructions 
set to 0. These bits correspond to PowerPC AS's 
Sprgg, and are reserved bits in POWER. The 
requirements described in this Note provide com- 
patibility only for POWER programs that have 
these bits set to 0. 


E.18 Effects of Exceptions on 
FPSCR Bits FR and FI 


For the following cases, POWER does not specify how 
FR and Fl are set, while PowerPC AS preserves them 
for Invalid Operation Exception caused by a Compare 
instruction, sets Fl to 1 and FR to an undefined value 
for disabled Overflow Exception, and clears them oth- 
erwise. 


m Invalid Operation Exception (enabled or disabled) 
m Zero Divide Exception (enabled or disabled) 
m Disabled Overflow Exception 


E.19 Store Floating-Point Single 
Instructions 


There are several respects in which PowerPC AS is 
incompatible with POWER for Store Floating-Point 
Single instructions. 


m POWER uses FPSCRyg to help determine whether 
denormalization should be done, while PowerPC 
AS does not. Using FPSCRyg is in fact incorrect: 
if FPSCRyg=1 and a denormalized single- 
precision number is copied from one storage 
location to another by means of Ifs followed by 
stfs, the two “copies” may not be the same. 


m For an operand having an exponent that is less 
than 874 (unbiased exponent less than — 149), 
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POWER stores a zero (if FPSCRyç=0) while 
PowerPC AS stores an undefined value. 


E.20 Move From FPSCR 


POWER defines the high-order 32 bits of the result of 
mffs to be OxFFFF_FFFF, while PowerPC AS specifies 
that they are undefined. 


E.21 Zeroing Bytes in the Data 
Cache 


The deliz instruction of POWER and the dcbz instruc- 
tion of PowerPC AS have the same opcode. However, 
the functions differ in the following respects. 


m dciz clears a line while dcbz clears a block. 

m dciz saves the EA in RA (if RA#0) while dcbz 
does not. 

m dclz is privileged while dcbz is not. 


E22 Synchronization 


The sync instruction (called dcs in POWER) and the 
isync instruction (called ics in POWER) cause more 
pervasive synchronization in PowerPC AS than in 
POWER. However, unlike dcs, sync does not wait until 
data cache block writes caused by preceding 
instructions have been performed in main storage. 
Also, sync has an L field while dcs does not. 


E23 Direct-Store Segments 


POWER's direct-store segments are not supported in 
PowerPC AS. 


E.24 Segment Register 
Manipulation Instructions 


The definitions of the four Segment Register Manipu- 
lation instructions mtsr, mtsrin, mfsr, and mfsrin differ 
in two respects between POWER and PowerPC AS. 
Instructions similar to mtsrin and mfsrin are called 
mtsri and mfsri in POWER. 


privilege: mfsr and  mfsri are problem state 
instructions in POWER, while mfsr and 
mfsrin are privileged in PowerPC AS. 


function: the "indirect" instructions (mtsri and 
mfsri) in POWER use an RA register in 
computing the Segment Register number, 
and the computed EA is stored into RA (if 
RAz0 and RAz RT), while in PowerPC AS 
mtsrin and mfsrin have no RA field and 
the EA is not stored. 


mtsr, mtsrin (mtsri), and mfsr have the same opcodes 
in PowerPC AS as in POWER. mfsri (POWER) and 
mfsrin (PowerPC AS) have different opcodes. 


Also, the Segment Register Manipulation instructions 
are required in POWER whereas they are optional in 
PowerPC AS. 


E.25 TLB Entry Invalidation 


The tlbi instruction of POWER and the tlbie instruction 
of PowerPC AS have the same opcode. However, the 
functions differ in the following respects. 


m tlbi computes the EA as (RAJO) + (RB), while 
tlbie lacks an RA field and computes the EA and 
related information as (RB). 

m tlbi saves the EA in RA (if RA#0), while tlbie 
lacks an RA field and does not save the EA. 

m For tlbi the high-order 36 bits of RB are used in 
computing the EA, while for tlbie these bits 
contain additional information that is not directly 
related to the EA. 

m tibie has an L field, while tlbi does not. 


Also, tlbi is required in POWER whereas tlbie is 
optional in PowerPC AS. 


E26 Alignment Interrupts 


Placing information about the interrupting instruction 
into the DSISR and the DAR when an Alignment inter- 
rupt occurs is optional in PowerPC AS but required in 
POWER. 


E27 Floating-Point Interrupts 


POWER uses MSR bit 20 to control the generation of 
interrupts for floating-point enabled exceptions, and 
PowerPC AS uses the corresponding MSR bit, bit 52, 
for the same purpose. However, in PowerPC AS this 
bit is part of a two-bit value that controls the occur- 
rence, precision, and recoverability of the interrupt, 
while in POWER this bit is used independently to 
control the occurrence of the interrupt (in POWER all 
floating-point interrupts are precise). 
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E.28 Timing Facilities 


E.28.1 Real-Time Clock 


The POWER Real-Time Clock is not supported in 
PowerPC AS. Instead, PowerPC AS provides a Time 
Base. Both the RTC and the TB are 64-bit Special 
Purpose Registers, but they differ in the following 
respects. 


m The RTC counts seconds and nanoseconds, while 
the TB counts “ticks”. The ticking rate of the TB 
is implementation-dependent. 

m The RTC increments discontinuously: 1 is added 
to RTCU when the value in RTCL passes 
999 999 999. The TB increments continuously: 1 
is added to TBU when the value in TBL passes 
OxFFFF_FFFF. 

m The RTC is written and read by the mispr and 
mfspr instructions, using SPR numbers that 
denote the RTCU and RTCL. The TB is written by 
the mtspr instruction (using new SPR numbers), 
and read by the new mftb instruction. 

= The SPR numbers that denote POWER's RTCL 
and RTCU are invalid in PowerPC AS. 

m The RTC is guaranteed to increment at least once 
in the time required to execute ten Add Imme- 
diate instructions. No analogous guarantee is 
made for the TB. 

m Not all bits of RTCL need be implemented, while 
all bits of the TB must be implemented. 


E.28.2 Decrementer 


The PowerPC AS Decrementer differs from the 
POWER Decrementer in the following respects. 


m The PowerPC AS DEC decrements at the same 
rate that the TB increments, while the POWER 
DEC decrements every nanosecond (which is the 
same rate that the RTC increments). 

= Not all bits of the POWER DEC need be imple- 
mented, while all bits of the PowerPC AS DEC 
must be implemented. 

m The interrupt caused by the DEC has its own 
interrupt vector location in PowerPC AS, but is 
considered an External interrupt in POWER. 


E.29 Deleted Instructions 


The following instructions are part of the POWER 
Architecture but have been dropped from the 
PowerPC AS Architecture. 


abs Absolute 

cles Cache Line Compute Size 

clf Cache Line Flush 

cli (*) Cache Line Invalidate 

dclst Data Cache Line Store 

div Divide 

divs Divide Short 

doz Difference Or Zero 

dozi Difference Or Zero Immediate 

Iscbx Load String And Compare Byte Indexed 
maskg Mask Generate 

maskir Mask Insert From Register 

mfsri Move From Segment Register Indirect 
mul Multiply 

nabs Negative Absolute 

rac (*) Real Address Compute 

rfi (*) Return From Interrupt 

rfsvc (*) Return From SVC 

rimi Rotate Left Then Mask Insert 

rrib Rotate Right And Insert Bit 

sle Shift Left Extended 

sleq Shift Left Extended With MQ 

sliq Shift Left Immediate With MQ 

slliq Shift Left Long Immediate With MQ 
silg Shift Left Long With MQ 

slq Shift Left With MQ 

sraiq Shift Right Algebraic Immediate With MQ 
sraq Shift Right Algebraic With MQ 

sre Shift Right Extended 

srea Shift Right Extended Algebraic 

sreq Shift Right Extended With MQ 

sriq Shift Right Immediate With MQ 

srliq Shift Right Long Immediate With MQ 
srlq Shift Right Long With MQ 

srq Shift Right With MQ 


(*) This instruction is privileged. 


Note: Many of these instructions use the MQ reg- 
ister. The MQ is not defined in the PowerPC AS 
Architecture. 
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E.30 Discontinued Opcodes 


The opcodes listed below are defined in the POWER 
Architecture but have been dropped from the 
PowerPC AS Architecture. The list contains the 
POWER mnemonic (MNEM), the primary opcode (PRI), 
and the extended opcode (XOP) if appropriate. The 
corresponding instructions are reserved in PowerPC 
AS. 


MNEM PRI XOP 
abs 31 360 
cles 31 531 
cif 31 118 
cli (*) 31 502 
dclst 31 630 
div 31 331 
divs 31 363 
doz 31 264 
dozi 09 = 
Iscbx 31 277 
maskg 31 29 
maskir 31 541 
mfsri 31 627 
mul 31 107 
nabs 31 488 
rac (*) 31 818 
rfi (*) 19 50 
rfsvc (*) 19 82 
rimi 22 — 
rrib 31 537 
sle 31 153 
sleq 31 217 
sliq 31 184 
slliq 31 248 
sllq 31 216 
slq 31 152 
sraiq 31 952 
sraq 31 920 
sre 31 665 
srea 31 921 
sreq 31 729 
sriq 31 696 
srliq 31 760 
sriq 31 728 
srq 31 664 


(*) This instruction is privileged. 


Assembler Note 


It might be helpful to current software writers for 
the Assembler to flag the discontinued POWER 
instructions. 


Engineering Note 


The instructions listed above are reserved in the 
PowerPC AS Architecture. For reasons of compat- 
ibility with the POWER Architecture, early imple- 
mentations must cause an Illegal Instruction type 
Program interrupt for an attempt to execute any 
of these instructions that are not privileged. 


Appendix E. Incompatibilities with the POWER Architecture 169 


IBM Confidential - Feb. 24, 1999 


E.31 POWER2 Compatibility 


The POWER2 instruction set is a superset of the 
POWER instruction set. Some of the instructions 
added for POWER2 are included in the PowerPC AS 
Architecture. Those that have been renamed in the 
PowerPC AS Architecture are listed in this section, as 


are the new POWER2 instructions that are not 
included in the PowerPC AS Architecture. 


Other incompatibilities are also listed. 


E.31.1 Cross-Reference for Changed POWER2 Mnemonics 


The following table lists the new POWER2 instruction 
mnemonics that have been changed in the PowerPC 
AS User Instruction Set Architecture, sorted by 
POWER2 mnemonic. 


To determine the PowerPC AS mnemonic for one of 
these POWER2 mnemonics, find the POWER2 mne- 


monic in the second column of the table: the 
remainder of the line gives the PowerPC AS mne- 
monic and the page on which the instruction is 
described, as well as the instruction names. 


POWER2 mnemonics that have not changed are not 
listed. 


POWER2 


Mnemonic 


112 |fcir[.] 
with Round 
112 |fcirz[.] 
with Round to Zero 


Floating Convert Double to Integer 


Floating Convert Double to Integer 


PowerPC AS 
Instruction 


fctiw[.] Floating Convert To Integer Word 


fctiwz[.] Floating Convert To Integer Word 


with round toward Zero 


E.31.2 Floating-Point Conversion to 
Integer 


The fcir and fcirz instructions of POWER2 have the 
same opcodes as do the fctiw and fctiwz instructions, 
respectively, of PowerPC AS. However, the functions 
differ in the following respects. 


m fcir and fcirz set the high-order 32 bits of the 
target FPR to OxFFFF FFFF, while fctiw and fctiwz 
set them to an undefined value. 

m Except for enabled Invalid Operation Exceptions, 
fcir and fcirz set the FPRF field of the FPSCR 
based on the result, while fctiw and fctiwz set it 
to an undefined value. 

m fcir and fcirz do not affect the VXSNAN bit of the 
FPSCR, while fctiw and fctiwz do. 

m fcir and fcirz set FPSCRyy to 1 for certain cases 
of "Large Operands" (i.e., operands that are too 
large to be represented as a 32-bit signed fixed- 
point integer), while fctiw and fctiwz do not alter 
it for any case of "Large Operand”. (The IEEE 
standard requires not altering it for "Large Oper- 
ands".) 


E.31.3 Storage Access Ordering 


POWER2 uses MSR bit 28 to control storage access 
ordering; the corresponding PowerPC AS MSR bit, bit 
60, is reserved, and no corresponding control is pro- 
vided. 


E.31.4 Floating-Point Interrupts 


POWER2 uses MSR bits 20 and 23 to control the gen- 
eration of interrupts for floating-point enabled 
exceptions, and PowerPC AS uses the corresponding 
MSR bits, bits 52 and 55, for the same purpose. 
However, in PowerPC AS these bits comprise a 
two-bit value that controls the occurrence, precision, 
and recoverability of the interrupt, while in POWER2 
these bits are used independently to control the 
occurrence (bit 20) and the precision (bit 23) of the 
interrupt. Moreover, in PowerPC AS all floating-point 
interrupts are considered Program interrupts, while in 
POWER2 imprecise floating-point interrupts have their 
own interrupt vector location. 


E.31.5 Trace 


The Trace interrupt vector location differs between 
the two architectures, and there are many other dif- 
ferences. 
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E.31.6 Deleted Instructions 


The following instructions are new in POWER2 imple- 
mentations of the POWER Architecture but have been 
dropped from the PowerPC AS Architecture. 


Ifq Load Floating-Point Quad 

Ifqu Load Floating-Point Quad with Update 

Ifqux Load Floating-Point Quad with Update 
Indexed 

Ifqx Load Floating-Point Quad Indexed 

stfq Store Floating-Point Quad 

stfqu Store Floating-Point Quad with Update 

stfqux Store Floating-Point Quad with Update 
Indexed 

stfqx Store Floating-Point Quad Indexed 


E.31.7 Discontinued Opcodes 


The opcodes listed below are new in POWER2 imple- 
mentations of the POWER Architecture but have been 
dropped from the PowerPC AS Architecture. The list 
contains the POWER2 mnemonic (MNEM), the primary 
opcode (PRI), and the extended opcode (XOP) if 
appropriate. The corresponding instructions are 
reserved in PowerPC AS. 


MNEM PRI XOP 
Ifq 56 = 
Ifqu 57 = 
Ifqux 31 823 
Ifqx 31 791 
stfq 60 — 
stfqu 61 — 
stfqux 31 951 
stfqx 31 919 


Engineering Note 


The instructions listed above are reserved in the 
PowerPC AS Architecture. For reasons of compat- 
ibility with POWER2 implementations of the 
POWER Architecture, early implementations must 
cause an Illegal Instruction type Program interrupt 
for an attempt to execute any of these 
instructions. 
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Appendix F. New Instructions 


The following instructions in the PowerPC AS User 
Instruction Set Architecture are new; they are not in 
the POWER Architecture. 


The following instructions are optional: fres, frsqrte, 
fsel, fsqrt[ s]. 


cntizd 
divd 
divdu 
divw 
divwu 
extsb 
extsw 
fadds 
fcfid 


fctid 
fctidz 


fctiw 
fctiwz 


fdivs 
fmadds 
fmsubs 
fmuls 
fnmadds 
fnmsubs 
fres 
frsqrte 
fsel 
fsqri| s] 
fsubs 

ld 


Count Leading Zeros Doubleword 
Divide Doubleword 

Divide Doubleword Unsigned 
Divide Word 

Divide Word Unsigned 
Extend Sign Byte 
Extend Sign Word 
Floating Add Single 
Floating Convert 
Doubleword 

Floating Convert To Integer Doubleword 
Floating Convert To Integer Doubleword 
with round toward Zero 

Floating Convert To Integer Word 

Floating Convert To Integer Word with 
round toward Zero 

Floating Divide Single 

Floating Multiply-Add Single 

Floating Multiply-Subtract Single 

Floating Multiply Single 

Floating Negative Multiply-Add Single 
Floating Negative Multiply-Subtract Single 
Floating Reciprocal Estimate Single 
Floating Reciprocal Square Root Estimate 
Floating Select 

Floating Square Root [Single] 

Floating Subtract Single 

Load Doubleword 


From Integer 


Idu 
Idux 
Idx 

lwa 
Iwaux 
Iwax 
mulhd 
mulhdu 
mulhw 
mulhwu 
mulid 
ridel 
rider 
ridic 


ridicl 


ridicr 


rldimi 


sid 
srad 
sradi 


srd 
sid 
stdu 
stdux 
stdx 
stfiwx 


subf 
td 
tdi 


Load Doubleword with Update 

Load Doubleword with Update Indexed 
Load Doubleword Indexed 

Load Word Algebraic 

Load Word Algebraic with Update Indexed 
Load Word Algebraic Indexed 

Multiply High Doubleword 

Multiply High Doubleword Unsigned 
Multiply High Word 

Multiply High Word Unsigned 

Multiply Low Doubleword 

Rotate Left Doubleword then Clear Left 
Rotate Left Doubleword then Clear Right 
Rotate Left Doubleword Immediate then 
Clear 

Rotate Left Doubleword Immediate then 
Clear Left 

Rotate Left Doubleword Immediate then 
Clear Right 

Rotate Left Doubleword Immediate then 
Mask Insert 

Shift Left Doubleword 

Shift Right Algebraic Doubleword 

Shift Right Algebraic Doubleword Imme- 
diate 

Shift Right Doubleword 

Store Doubleword 

Store Doubleword with Update 

Store Doubleword with Update Indexed 
Store Doubleword Indexed 
Store Floating-Point as 
Indexed 

Subtract From 

Trap Doubleword 

Trap Doubleword Immediate 


Integer Word 
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Appendix G. Illegal Instructions 


With the exception of the instruction consisting 
entirely of binary Os, the instructions in this class are 
available for future extensions of the PowerPC AS 
Architecture; that is, some future version of the 
PowerPC AS Architecture may define any of these 
instructions to perform new functions. 


The following primary opcodes are illegal. 
1,4,5,6 


The following primary opcodes have unused extended 
opcodes. Their unused extended opcodes can be 
determined from the opcode maps in Appendix |. All 
unused extended opcodes are illegal. 


19, 30, 31, 56, 57, 58, 59, 60, 61, 62, 63 
An instruction consisting entirely of binary Os is 


illegal, and is guaranteed to be illegal in all future 
versions of this architecture. 
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Appendix H. Reserved Instructions 


The instructions in this class are allocated to specific 
purposes that are outside the scope of the PowerPC 
AS User Instruction Set Architecture, PowerPC AS 
Virtual Environment Architecture, and PowerPC AS 
Operating Environment Architecture. 


The following types of instruction are included in this 
class. 


1. 


The instruction having primary opcode 0, except 
the instruction consisting entirely of binary Os 
(which is an illegal instruction; see Section 1.8.2, 
“Illegal Instruction Class" on page 11) and the 
extended opcode shown below. 


256 Service Processor “Attention” (PowerPC AS 
only) 


Instructions for the POWER Architecture that have 
not been included in the PowerPC AS Architec- 
ture. These are listed in Section E.30, “Discon- 
tinued Opcodes” on page 169 and Section E.31.7, 
“Discontinued Opcodes” on page 171. 


Implementation-specific instructions used to 
conform to the PowerPC AS Architecture specifi- 
cation. 


Any other instructions contained in Book IV, 
PowerPC AS Implementation Features for any 
implementation, that are not defined in the 
PowerPC AS User Instruction Set Architecture, 
PowerPC AS Virtual Environment Architecture, or 
PowerPC AS Operating Environment Architecture. 
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Appendix |. Opcode Maps 


This section contains tables showing the opcodes and 
extended opcodes in all members of the POWER 
architecture family. 


For the primary opcode table (Table 12 on page 181), 
each cell is in the following format. 


Opcode in 
Decimal 


Opcode in 
Hexadecimal 


Instruction 
Mnemonic 


Instruction 
Format 


Applicable 
Machines 


“Applicable Machines” identifies the POWER architec- 
ture family members that recognize the opcode, 
encoded as follows: 


A PowerPC AS 

P PowerPC 

2 | POWER2 

O Original POWER (RS/6000) 
All All of the above 


The extended opcode tables show the extended 
opcode in decimal, the instruction mnemonic, the 
applicable machines, and the instruction format. 
These tables appear in order of primary opcode within 
two groups. The first group consists of the primary 
opcodes that have small extended opcode fields (2-4 
bits), namely 30, 56, 57, 58, 60, 61, and 62. The 
second group consists of primary opcodes that have 
10-bit extended opcode fields. The tables for the 
second group are rotated. 


In the extended opcode tables several special 
markings are used. 


m A prime (’) following an instruction mnemonic 
denotes an additional cell, after the lowest- 
numbered one, used by the instruction. For 
example, subfc occupies cells 8 and 520 of 
primary opcode 31, with the former corresponding 
to OE=0 and the latter to OE=1. Similarly, sradi 
occupies cells 826 and 827, with the former corre- 
sponding to shs=0 and the latter to shs=1 (the 


9-bit extended opcode 413, shown on page 76, 
excludes the shs bit). 


m Two vertical bars (||) are used instead of primed 
mnemonics when an instruction occupies an 
entire column of a table. The instruction mne- 
monic is repeated in the last cell of the column. 


m For primary opcode 31, an asterisk (*) in a cell 
that would otherwise be empty means that the 
cell is reserved because it is “overlaid”, by a 
fixed-point or Storage Access instruction having 
only a primary opcode, by an instruction having 
an extended opcode in primary opcode 30, 58, or 
62, or by a potential instruction in any of the cate- 
gories just mentioned. The overlaying instruc- 
tion, if any, is also shown. A cell thus reserved 
should not be assigned to an instruction having 
primary opcode 31. (The overlaying is a conse- 
quence of opcode decoding for fixed-point 
instructions: the primary opcode, and the 
extended opcode if any, are mapped internally to 
a 10-bit “compressed opcode” for ease of subse- 
quent decoding.) 


m Parentheses around the opcode or extended 
opcode mean that the instruction was defined in 
earlier versions of the PowerPC AS Architecture 
but is no longer defined in the PowerPC AS Archi- 
tecture. 


An empty cell, a cell containing only an asterisk, or a 
cell in which the opcode or extended opcode is paren- 
thesized, corresponds to an illegal instruction. 


When instruction names and/or mnemonics differ 
among the family members, the PowerPC AS/PowerPC 
terminology is used. 


The instruction consisting entirely of binary Os causes 
the system illegal instruction error handler to be 
invoked for all members of the POWER family, and 
this is likely to remain true in future models (it is 
guaranteed in the PowerPC AS Architecture). An 
instruction having primary opcode 0 but not consisting 
entirely of binary Os is reserved except for the fol- 
lowing extended opcode (instruction bits 21:30). 


256 Service Processor “Attention” (PowerPC AS 
only) 
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Engineering Note 


Architecture Note 

Implementation-specific instructions must be privi- The following opcodes are reserved because they 
leged, and must comply with the other guidelines are used in some implementations. 
and limitations given in the Preface of Book |. ; : 
Opcodes for implementation-specific instructions ao Primary opcode ia emendas opcode y! 

pco p p 
must be requested in advance from the person m Primary opcode 31: extended opcodes 131, 
responsible for the technical content of this docu- 163, 262, 274, 308, 323, 451, 454, 486, 914, 
ment (see the cover page). 946, 966, 978, 998, 1010 


These opcodes will not be assigned a meaning in 
the PowerPC AS Architecture except after careful 
consideration of the effect of such assignment on 
existing implementations. The same applies to 
opcodes that are parenthesized in the opcode 
maps. 


180 PowerPC AS User Instruction Set Architecture, Tags Inactive Subset 


IBM Confidential - Feb. 24, 1999 


Table 12. Primary opcodes 


Reserved 
| 


subfic 


rlwimi 


FX 
Extended Ops 
All 


All D|AII D|AII D|AII D 
36 24/37 25138 26/39 27 
stw stwu stb stbu 
All D|AII D|AII D|AII D 
40 28/41 29/42 2A|43 2B 
Ihz Ihzu lha lhau 
All D|AII D|AII D|AII D 
44 2C|45 2D|46 2E |47 2F 
sth sthu Imw stmw 
All DIAll D|AII D|AII D 
48 30/49 31/50 32 |51 33 
Ifs Ifsu Ifd Ifdu 


58 3A 
Ifq q FX DS-form 
3 illegal Loads 
DS 


FP Single 
Extended Ops 
AP AP A 


3 illegal 
DS|2 


62 3E |63 SF 
FX DS-Form FP Double 
Stores Extended Ops 
S|AP DS |All 


stfq, 
3 illegal 
DS|2 


stfqu, 
3 illegal 
D 


See primary opcode 0 extensions on page 179 


Trap Doubleword Immediate 
Trap Word Immediate 


Multiply Low Immediate 


Subtract From Immediate Carrying 
Difference or Zero Immediate 
Compare Logical Immediate 
Compare Immediate 


Add Immediate Carrying 

Add Immediate Carrying and Record 
Add Immediate 

Add Immediate Shifted 


Branch Conditional 
System Call 

Branch 

See Table 20 on page 184 


Rotate Left Word Imm. then Mask Insert 
Rotate Left Word Imm. then AND with Mask 
Rotate Left then Mask Insert 

Rotate Left Word then AND with Mask 


OR Immediate 

OR Immediate Shifted 
XOR Immediate 

XOR Immediate Shifted 


AND Immediate 

AND Immediate Shifted 
See Table 13 on page 182 
See Table 21 on page 186 


Load Word and Zero 
Load Word and Zero with Update 
Load Byte and Zero 
Load Byte and Zero with Update 


Store Word 
Store Word with Update 
Store Byte 
Store Byte with Update 


Load Half and Zero 
Load Half and Zero with Update 
Load Half Algebraic 
Load Half Algebraic with Update 


Store Half 

Store Half with Update 
Load Multiple Word 
Store Multiple Word 


Load Floating-Point Single 
Load Floating-Point Single with Update 
Load Floating-Point Double 
Load Floating-Point Double with Update 


Store Floating-Point Single 
Store Floating-Point Single with Update 
Store Floating-Point Double 
Store Floating-Point Double with Update 


See Table 14 on page 183 
See Table 15 on page 183 
See Table 16 on page 183 
See Table 22 on page 188 


See Table 17 on page 183 
See Table 18 on page 183 
See Table 19 on page 183 
See Table 23 on page 190 
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Table 13. Extended opcodes for primary opcode 30 
(instruction bits 27:30) 


y 


rldimi 
AP 
MD 
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Table 14. Extended opcodes for primary opcode 56 


Table 15. Extended opcodes for primary opcode 57 


(instruction bits 30:31) 


(instruction bits 30:31) 
0 1 0 1 
0 0 
Ifq Ifqu 
0 > y 2 
DS DS 


Table 16. Extended opcodes for primary opcode 58 Table 17. Extended opcodes for primary opcode 60 
(instruction bits 30:31) (instruction bits 30:31) 

0 1 0 1 
0 1 0 

0 Id Idu 0 stfq 
AP AP 2 
DS DS DS 
2 
lwa 

1 AP 1 
DS 


Table 18. Extended opcodes for primary opcode 61 
(instruction bits 30:31) 


(instruction bits 30:31) 


Table 19. Extended opcodes for primary opcode 62 


0 1 0 1 
0 0 1 
0 stfqu 0 std stdu 
2 AP AP 
DS DS DS 
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Table 20 (Page 1 of 2). Extended opcodes for primary opcode 19 (instruction bits 21:30) 


10000 


10010 


10011 


10100 


10101 


10110 


10111 


11000 


11001 


11010 


11011 


11100 


11101 


11110 


00000 | 00001 | 00010 


E 00100 


ns É 


crandi isync 
00100 All All 
XL XL 
00101 
193 
crxor 
00110 A 
XL 
225 
crnanh 
00111 A 
XL 
257 
crand| 
01000 A 
XL 
289 
creqv 
01001 A 
XL 
01010 
01011 
01100 
417 
crore 
01101 All 
XL 
449 
cror 
01110 All 


XL 
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Table 20 (Page 2 of 2). Extended opcodes for primary opcode 19 (instruction bits 21:30) 


00000 | 00001 | 00010 | 00011 | 00100 | 00101 | 00110) 00111 | 01000 | 01001 | 01010) 01011 | 01100) 01101 | 01110) 01111 


10000 


10001 


10010 


10011 


10100 


10101 


10110 


10111 


11000 


11001 


11010 


11011 


11100 


11101 


11110 


11111 


10000 


528 
bectr 
All 
XL 


10001 


10010 


10011 


10100 


10101 


10110 


10111 


11000 


11001 


11010 


11011 


11100 


11101 


11110 


11111 
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Table 21 (Page 1 of 2). Extended opcodes for primary opcode 31 (instruction bits 21:30) 


00000 | 00001 | 00010 00100 01000] 01001] 01010 | 01011 11101/11110/11111 
4 29 30 
tw maskg rldicl| 
All 20 P 
X X 
(82) 83 84 
mtsrd, mfmst Idarx 
All AP 
x x 
00011 neg mul twi* mtsrdjn cif |Ibzux nor ridicrf 
All 20 All AP 20 A All AP 
XO XO D x x x x MD 
131 136 138 143 144 146 149 150 151 152 153 158 159 
00100 Res'd subfe adde $ mtcrf mtmst stdx | stwcx| stwx | slq sle ridic*| rlwim 
AP All All All All AP AP A 20 20 AP All 
XO XO XFX x x x x x x MD M 
163 
Res'd 
00101 AP 
202 
addze 
All 
XO 
232 233 234 235 239 242 246 247 248 254 255 
00111 subfmemulld| addmemull mulli* mtsrin dcbtsi stbux | slliq ridimi*rlwnm* 
All AP All All All All AP A 20 AP A 
XO XO XO XO D x x x x MD M 
262 264 266 271 274 277 278 279 284 286 287 
01000 Res'd doz add subfic* Res'd Iscbx | debt | Ihzx eqv rldcl*| ori* 
AP 20 All All A 20 AP A All AP A 
xo XO D x x x x MDS D 
303 306 308 310 | 31 316 318 | 319 
01001 o 
323 331 
Res'd div 
AP 20 
XO 
360 363 382 
01011 abs divs cmpi* tibia | mftb Iwaux Ihaux * xoris 
20 20 A AP AP AP A A 
XO XO D x XFX x x D 
399 402 407 412 414 415 
01100 addict slbmte sthx ore * andi. 
A A A All A 
D X X X D 
431 434 438 439 444 446 447 
01101 addic.* slbie ecowx sthux or * andis|* 
A AP AP A All A 
451 454 457 459 
Res'd Res'd divdu divwul 
AP AP AP AP 
XO 
486 488 | 489 491 495 498 501 502 | 503 510 
01111 Res'd nabs | divd divw addis] slbia i: cli | stmw? * 
AP 20 AP AP A AP 20 A 
XO XO XO D x x D 
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Table 21 (Page 2 of 2). Extended opcodes for primary opcode 31 (instruction bits 21:30) 


10110 


534 
Iwbrx 
All 
X 


00000 | 00001 | 00010 00100 
512 
All 
X 


01000 | 01001 | 01010] 01011] 01100 
ddc | mulhwu” 


566 


tlbsyncifs 


AP 


567 


616 619 627 630 63 
10011 neg’ mul mfsri dclst | Ifdux 
All 20 20 20 A 
XO XO x x x 
648 650 659 661 662 663 664 665 
10100 subfe adde” mfsrin stswx| stwbrx stfsx | srq | sre 
All All AP All All A 20 20 
XO XO x x x x x x 
695 696 
10101 stfsux sriq 
745 746 747 (758) | 759 760 
10111 lenulld| addmemull dcba | stfdux a 
AP All All AP A 2 
XO XO XO x x x 
778 790 79 792 794 
add’ Ihbrx | Ifqx | sraw srad 
11909 All All | 2 | AI AP 
XO x X X X 
818 823 824 826 827 
11001 rac Ifqux | srawi sradi | sradi 
20 2 All AP AP 
x X x XS XS 
> 851 
O slbmfev 
3 
= 872 875 
2. Goi abs’ divs’ 
>< 20 20 
129 XO XO 
!) 914 915 918 919 920 921 922 
11100 Res'd | slbmfee ASA sraq | srea | extsh 
O AP | A All 20 | 20 | All 
-5 X X x x x x 
O 946 951 952 954 
o 11101 Res'd stigur sraiq extsb 
o AP 20 AP 
O x X X 
= 966 969 971 978 982 983 986 
11110 Res'd divdu divwul Res'd icbi | stfiwx| extsw 
S5 AP AP AP AP AP | AP AP 
XO XO x x x 
o 998 1000 | 1001 1003 1010 1014 
11111 Res'd rod oa o Res'd conz 
AP AP 
XO XO XO X 
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Table 22 (Page 1 of 2). Extended opcodes for primary opcode 59 (instruction bits 21:30) 


00000 | 00001 | 00010 00100 


10000 


10001 


10010 


10100 


10101 


10110 


10111 


11000 


11001 


11010 


11100 


11101 


11110 


18 
fdivs 

AP 
A 


20 
fsubs 

A p 
A 


21 
fadds 

AP 
A 


22 
fsqrts| 
A p 


A 


24 
fres 
A p 
A 


25 
fmuls 

AP 
A 


28 


fmsubsfma 


AP 
A 


29 
AP 


A 


30 


A 


dds$nmsubisimad 
AP 


00011 


00100 


00101 


00111 


01000 


01001 


01011 


01100 


01101 


01111 
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Table 22 (Page 2 of 2). Extended opcodes for primary opcode 59 (instruction bits 21:30) 


00000|00001|00010 00100 


10000 


10001 


10010 


10011 


10100 


10101 


10110 


10111 


11000 


11001 


11010 


11011 


11100/11101/11110 


10011 


10100 


10101 


10111 


11000 


11001 


11011 


11100 


11101 


11110 


11111 


fdivs 


fsubs 


fadds 


fsqrts| 


fres 


fmuls 


fmsubsfmadds¢nmsulbisimadds 
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Table 23 (Page 1 of 2). E opcodes for primary opcode 63 (instruction bits 21:30) 


11010 


11011 


26 
frsqrte 
AP 
A 


00100 


134 136 
mtfsf fnabs 
All All 
x X 


00101 


00111 


01000 


01001 


01011 


01100 


01101 


01111 
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Table 23 (Page 2 of 2). Extended opcodes for primary opcode 63 (instruction bits 21:30) 
00000 | 00001 | 00010 00100 


10000 


10001 


10010 


10011 


10100 


10101 


10110 


10111 


11000 


11001 


11010 


11011 


11100 11101 11110 


10011 
10100 
10101 
10111 
11000 
814 | 815 
fctid | fctidz 
11001 AP AP 
X x 
> 846 
Ee) fcfid 
3 y 
X 
> 
2 11011 
>< 
O 11100 
© 
O 
S 11101 
D 
= 11110 
O 
o 
11111 
mk fdiv fsub | fadd | fsgrt | fsel fmul frsqrte fmsub fmadd fnmsutínma d 


L6 


6661 “bz “Gad - jenuepiuo?) WAI 


IBM Confidential - Feb. 24, 1999 


192 PowerPC AS User Instruction Set Architecture, Tags Inactive Subset 


IBM Confidential - Feb. 24, 1999 


Appendix J. PowerPC AS Instruction Set Sorted by Opcode 


This appendix lists all the instructions in the PowerPC 
AS Architecture, in order by opcode. A page number 
is shown for instructions that are defined in this Book 
(Book |, PowerPC AS User Instruction Set 
Architecture), and the Book number is shown for 


instructions that are defined in other Books (Book Il, 
PowerPC AS Virtual Environment Architecture, and 
Book Ill, PowerPC AS Operating Environment Archi- 
tecture). If an instruction is defined in more than one 
of these Books, the lowest-numbered Book is used. 


Opcode Mode | Page / |. onic 
Primary | Extend |Dep.'| Bk 

D 2 60 | tdi 
D 3 60 | twi 
D 7 54 | mulli 
D 8 SR 51 | subfic 
D 10 59 | cmpli 
D 11 58 | cmpi 
D 12 SR 50 | addic 
D 13 SR 50 | addic. 
D 14 49 | addi 
D 15 49 | addis 
B 16 CT 23 | be[l][a] 
SC 17 25 | sc 
| 18 23 | b[I][a] 
XL 19 0 28 | mcrf 
XL 19 16 CT 24 | belr[!] 
XL 19 18 Ill rfid 
XL 19 33 27 | crnor 
XL 19 129 27 | crande 
XL 19 150 Il} isync 
XL 19 193 26 | crxor 
XL 19 225 26 | crnand 
XL 19 257 26 | crand 
XL 19 289 27 | creqv 
XL 19 417 27 | crore 
XL 19 449 26 | cror 
XL 19 528 CT 24 | bectr[l] 
M 20 SR 73 | rlwimi[.] 
M 21 SR 70 | rlwinm[.] 
M 23 SR 72 | rlwnm[.] 
D 24 63 | ori 
D 25 63 | oris 
D 26 63 | xori 
D 27 63 | xoris 
D 28 SR 62 | andi. 
D 29 SR 62 | andis. 
MD 30 0 SR 69 | rldicl[.] 
MD 30 1 SR 69 | rldicr[.] 
MD 30 2 SR 70 | ridic[.] 
MD 30 3 SR 73 | rldimi[.] 
MDS 30 8 SR 71 | rldci[.] 
MDS 30 9 SR 72 | ridcr[.] 


Instruction 


Trap Doubleword Immediate 

Trap Word Immediate 

Multiply Low Immediate 

Subtract From Immediate Carrying 

Compare Logical Immediate 

Compare Immediate 

Add Immediate Carrying 

Add Immediate Carrying and Record 

Add Immediate 

Add Immediate Shifted 

Branch Conditional 

System Call 

Branch 

Move Condition Register Field 

Branch Conditional to Link Register 

Return from Interrupt Doubleword 

Condition Register NOR 

Condition Register AND with Complement 
Instruction Synchronize 

Condition Register XOR 

Condition Register NAND 

Condition Register AND 

Condition Register Equivalent 

Condition Register OR with Complement 
Condition Register OR 

Branch Conditional to Count Register 

Rotate Left Word Immediate then Mask Insert 
Rotate Left Word Immediate then AND with Mask 
Rotate Left Word then AND with Mask 

OR Immediate 

OR Immediate Shifted 

XOR Immediate 

XOR Immediate Shifted 

AND Immediate 

AND Immediate Shifted 

Rotate Left Doubleword Immediate then Clear Left 
Rotate Left Doubleword Immediate then Clear Right 
Rotate Left Doubleword Immediate then Clear 
Rotate Left Doubleword Immediate then Mask Insert 
Rotate Left Doubleword then Clear Left 
Rotate Left Doubleword then Clear Right 
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Opcode Mode | Page / . f 

orm 1 Mnemonic Instruction 

Primary | Extend | Dep.’ | Bk 
X 31 0 58 | cmp Compare 
X 31 4 61 |tw Trap Word 
XO 31 8 SR 51 | subfc[o][.] Subtract From Carrying 
XO 31 9 SR 55 | mulhdu[.] Multiply High Doubleword Unsigned 
XO 31 10 SR 51 | addc[o][.] Add Carrying 
XO 31 11 SR 55 | mulhwu[.] Multiply High Word Unsigned 
XFX 31 19 80 | mfcr Move From Condition Register 
XFX 31 19 120 | mfcr Move From Condition Register (optional version) 
X 31 20 Il} Iwarx Load Word And Reserve Indexed 
X 31 21 37 | Idx Load Doubleword Indexed 
X 31 23 35 | Iwzx Load Word and Zero Indexed 
X 31 24 SR 74 | slw[.] Shift Left Word 
X 31 26 SR 67 | cntlzw[.] Count Leading Zeros Word 
X 31 27 SR 74 | sld[.] Shift Left Doubleword 
X 31 28 SR 64 | and[.] AND 
X 31 32 59 | cmpl Compare Logical 
XO 31 40 SR 50 | subf[o][.] Subtract From 
X 31 53 37 | Idux Load Doubleword with Update Indexed 
X 31 54 Il | dcbst Data Cache Block Store 
X 31 55 35 | Iwzux Load Word and Zero with Update Indexed 
X 31 58 SR 67 | cntlzd[.] Count Leading Zeros Doubleword 
X 31 60 SR 65 | andc[.] AND with Complement 
X 31 68 61 | td Trap Doubleword 
XO 31 73 SR 55 | mulhd[.] Multiply High Doubleword 
XO 31 75 SR 55 | mulhw[.] Multiply High Word 
X 31 83 Ill | mfmsr Move From Machine State Register 
X 31 84 Il | Idarx Load Doubleword And Reserve Indexed 
X 31 86 Il} dcbf Data Cache Block Flush 
X 31 87 32 | Ibzx Load Byte and Zero Indexed 
XO 31 104 SR 53 | neg[o][.] Negate 
X 31 119 32 | Ibzux Load Byte and Zero with Update Indexed 
X 31 124 SR 65 | nor[.] NOR 
XO 31 136 SR 52 | subfe[o][.] Subtract From Extended 
XO 31 138 SR 52 | adde[o][.] Add Extended 
XFX 31 144 80 | mtcrf Move To Condition Register Fields 
XFX 31 144 120 | mtcrf Move To Condition Register Field (optional version) 
X 31 146 IIl | mtmsr Move To Machine State Register 
X 31 149 41 | stdx Store Doubleword Indexed 
X 31 150 Il | stwcx. Store Word Conditional Indexed 
X 31 151 40 | stwx Store Word Indexed 
X 31 178 Ill | mtmsrd Move To Machine State Register Doubleword 
X 31 181 41 | stdux Store Doubleword with Update Indexed 
X 31 183 40 | stwux Store Word with Update Indexed 
XO 31 200 SR 53 | subfze[o][.] | Subtract From Zero Extended 
XO 31 202 SR 53 | addze[o][.] | Add to Zero Extended 
X 31 210 32 IIl | mtsr Move To Segment Register 
X 31 214 Il} stdcx. Store Doubleword Conditional Indexed 
X 31 215 38 | stbx Store Byte Indexed 
XO 31 232 SR 52 | subfme[o][.] | Subtract From Minus One Extended 
XO 31 233 SR 54 | mulld[0][.] Multiply Low Doubleword 
XO 31 234 SR 52 | addme[o][.] | Add to Minus One Extended 
XO 31 235 SR 54 | mullw[0][.] Multiply Low Word 
X 31 242 32 III | mtsrin Move To Segment Register Indirect 
X 31 246 Il | dcbtst Data Cache Block Touch for Store 
X 31 247 38 | stbux Store Byte with Update Indexed 
XO 31 266 SR 50 | add[o][.] Add 
X 31 278 Il | dcbt Data Cache Block Touch 
X 31 279 33 | Ihzx Load Halfword and Zero Indexed 
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E ia Mode Page / Mnemonic 

Primary | Extend |Dep.'| Bk 
X 31 284 SR 65 | eqv[.] 
X 31 306 64 IIl | tlbie 
X 31 310 I| | eciwx 
X 31 311 33 | Ihzux 
X 31 316 SR 64 | xor[.] 
XFX 31 339 79 | mfspr 
X 31 341 36 | Iwax 
X 31 343 34 | lhax 
X 31 370 Ill | tibia 
XFX 31 371 Il} mftb 
X 31 373 36 | Iwaux 
X 31 375 34 | Ihaux 
X 31 402 IIl | slbmte 
X 31 407 39 | sthx 
X 31 412 SR 65 | orc[.] 
XS 31 413 SR 76 | sradi[.] 
X 31 434 IIl | slbie 
X 31 438 Il | ecowx 
X 31 439 39 | sthux 
X 31 444 SR 64 | or[.] 
XO 31 457 SR 57 | divdu[o][.] 
XO 31 459 SR 57 | divwu[o][.] 
XFX 31 467 78 | mtspr 
X 31 476 SR 64 | nand[.] 
XO 31 489 SR 56 | divd[o][.] 
XO 31 491 SR 56 | divw[o][.] 
X 31 498 IIl | slbia 
X 31 512 80 | merxr 
X 31 533 46 | Iswx 
X 31 534 42 | Iwbrx 
X 31 535 99 | Ifsx 
X 31 536 SR 75 | srw[.] 
X 31 539 SR 75 | srd[.] 
X 31 566 IIl tlosyne 
X 31 567 99 |Ifsux 
X 31 595 32 Ill | mfsr 
X 31 597 46 | Iswi 
X 31 598 Il} sync 
X 31 599 100 | Ifdx 
X 31 631 100 | Ifdux 
X 31 659 32 WI] mfsrin 
X 31 661 47 | stswx 
X 31 662 43 | stwbrx 
X 31 663 102 | stfsx 
X 31 695 102 | stfsux 
X 31 725 47 | stswi 
X 31 727 103 | stfdx 
X 31 759 103 | stfdux 
X 31 790 42 | Ihbrx 
X 31 792 SR 77 | sraw[.] 
X 31 794 SR 77 | srad[.] 
X 31 824 SR 76 | srawi[.] 
X 31 851 Ill} slbmfev 
X 31 854 Il | eieio 
X 31 915 Ill} slomfee 
X 31 918 43 | sthbrx 
X 31 922 SR 66 | extsh[.] 
X 31 954 SR 66 | extsb[.] 


Instruction 


Equivalent 

TLB Invalidate Entry 

External Control In Word Indexed 

Load Halfword and Zero with Update Indexed 
XOR 

Move From Special Purpose Register 

Load Word Algebraic Indexed 

Load Halfword Algebraic Indexed 

TLB Invalidate All 

Move From Time Base 

Load Word Algebraic with Update Indexed 
Load Halfword Algebraic with Update Indexed 
SLB Move To Entry 

Store Halfword Indexed 

OR with Complement 

Shift Right Algebraic Doubleword Immediate 
SLB Invalidate Entry 

External Control Out Word Indexed 

Store Halfword with Update Indexed 

OR 

Divide Doubleword Unsigned 

Divide Word Unsigned 

Move To Special Purpose Register 

NAND 

Divide Doubleword 

Divide Word 

SLB Invalidate All 

Move to Condition Register from XER 

Load String Word Indexed 

Load Word Byte-Reverse Indexed 

Load Floating-Point Single Indexed 

Shift Right Word 

Shift Right Doubleword 

TLB Synchronize 

Load Floating-Point Single with Update Indexed 
Move From Segment Register 

Load String Word Immediate 

Synchronize 

Load Floating-Point Double Indexed 

Load Floating-Point Double with Update Indexed 
Move From Segment Register Indirect 

Store String Word Indexed 

Store Word Byte-Reverse Indexed 

Store Floating-Point Single Indexed 

Store Floating-Point Single with Update Indexed 
Store String Word Immediate 

Store Floating-Point Double Indexed 

Store Floating-Point Double with Update Indexed 
Load Halfword Byte-Reverse Indexed 

Shift Right Algebraic Word 

Shift Right Algebraic Doubleword 

Shift Right Algebraic Word Immediate 

SLB Move From Entry VSID 

Enforce In-order Execution of I/O 

SLB Move From Entry ESID 

Store Halfword Byte-Reverse Indexed 

Extend Sign Halfword 

Extend Sign Byte 
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Opcode Mode 
rm 1 
Primary | Extend | Dep. 

X 31 982 
X 31 983 
X 31 986 SR 
X 31 1014 
D 32 

D 33 

D 34 

D 35 

D 36 

D 37 

D 38 

D 39 

D 40 

D 41 

D 42 

D 43 

D 44 

D 45 

D 46 

D 47 

D 48 

D 49 

D 50 

D 51 

D 52 

D 53 

D 54 

D 55 

DS 58 0 
DS 58 1 
DS 58 2 
A 59 18 
A 59 20 
A 59 21 
A 59 22 
A 59 24 
A 59 25 
A 59 28 
A 59 29 
A 59 30 
A 59 31 
DS 62 0 
DS 62 1 
X 63 0 
X 63 12 
X 63 14 
X 63 15 
A 63 18 
A 63 20 
A 63 21 
A 63 22 
A 63 23 
A 63 25 
A 63 26 
A 63 28 
A 63 29 
A 63 30 
A 63 31 


Mnemonic 


icbi 
stfiwx 
extsw[.] 
dcbz 
Iwz 
Iwzu 
lbz 
Ibzu 
stw 
stwu 
stb 
stbu 
Ihz 
lhzu 
lha 
lhau 
sth 
sthu 
Imw 
stmw 
lfs 
lfsu 
Ifd 


fadds[.] 
fsqrts[.] 
fres[.] 
fmuls[.] 
fmsubs[.] 
fmadds[.] 
fnmsubs[.] 
fnmadds[.] 
std 

stdu 
fcmpu 
frsp[.] 
fctiw[.] 
fctiwz[.] 
fdiv[.] 
fsub[.] 
fadd[.] 
fsqrt[.] 
fsel[.] 
fmul[.] 
frsqrte[.] 
fmsub[.] 
fmadd[.] 
fnmsub[.] 
fnmadd[.] 


Instruction 


Instruction Cache Block Invalidate 
Store Floating-Point as Integer Word Indexed 
Extend Sign Word 

Data Cache Block set to Zero 

Load Word and Zero 

Load Word and Zero with Update 

Load Byte and Zero 

Load Byte and Zero with Update 

Store Word 

Store Word with Update 

Store Byte 

Store Byte with Update 

Load Halfword and Zero 

Load Halfword and Zero with Update 
Load Halfword Algebraic 

Load Halfword Algebraic with Update 
Store Halfword 

Store Halfword with Update 

Load Multiple Word 

Store Multiple Word 

Load Floating-Point Single 

Load Floating-Point Single with Update 
Load Floating-Point Double 

Load Floating-Point Double with Update 
Store Floating-Point Single 

Store Floating-Point Single with Update 
Store Floating-Point Double 

Store Floating-Point Double with Update 
Load Doubleword 

Load Doubleword with Update 

Load Word Algebraic 

Floating Divide Single 

Floating Subtract Single 

Floating Add Single 

Floating Square Root Single 

Floating Reciprocal Estimate Single 
Floating Multiply Single 

Floating Multiply-Subtract Single 
Floating Multiply-Add Single 

Floating Negative Multiply-Subtract Single 
Floating Negative Multiply-Add Single 
Store Doubleword 

Store Doubleword with Update 

Floating Compare Unordered 

Floating Round to Single-Precision 
Floating Convert To Integer Word 
Floating Convert To Integer Word with round toward Zero 
Floating Divide 

Floating Subtract 

Floating Add 

Floating Square Root 

Floating Select 

Floating Multiply 

Floating Reciprocal Square Root Estimate 
Floating Multiply-Subtract 

Floating Multiply-Add 

Floating Negative Multiply-Subtract 
Floating Negative Multiply-Add 
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Form oprane Moar Page: Mnemonic 
Primary | Extend | Dep.’ | Bk 

X 63 32 114 | fempo 
X 63 38 117 | mtfsb1[.] 
X 63 40 105 | fneg[.] 
X 63 64 115 | mcrfs 

X 63 70 117 | mtfsbO[.] 
X 63 72 105 | fmr[.] 

X 63 134 116 | mtfsfi[.] 
X 63 136 105 | fnabs[.] 
X 63 264 105 | fabs[.] 
X 63 583 115 | mffs[.] 
XFL 63 711 116 | mtfsf[.] 
X 63 814 111 | fetid[.] 
X 63 815 111 | fctidz[.] 
X 63 846 113 | fcfid[.] 


Instruction 


Floating Compare Ordered 

Move To FPSCR Bit 1 

Floating Negate 

Move to Condition Register from FPSCR 
Move To FPSCR Bit O 

Floating Move Register 

Move To FPSCR Field Immediate 

Floating Negative Absolute Value 

Floating Absolute Value 

Move From FPSCR 

Move To FPSCR Fields 

Floating Convert To Integer Doubleword 
Floating Convert To Integer Doubleword with round 
toward Zero 

Floating Convert From Integer Doubleword 


1See key to mode dependency column, on page 203. 
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Appendix K. PowerPC AS Instruction Set Sorted by Mnemonic 


This appendix lists all the instructions in the PowerPC 
A page 
number is shown for instructions that are defined in 
this Book (Book |, PowerPC AS User Instruction Set 
Architecture), and the Book number is shown for 


AS Architecture, in order by mnemonic. 


instructions that are defined in other Books (Book Il, 
PowerPC AS Virtual Environment Architecture, and 
Book Ill, PowerPC AS Operating Environment Archi- 


tecture). 


If an instruction is defined in more than one 


of these Books, the lowest-numbered Book is used. 


Opcode Mode | Page / |. e monic 
Primary | Extend Dep.'| Bk 

XO 31 266 SR 50 | add[o][.] 
XO 31 10 SR 51 | addc[o][.] 
XO 31 138 SR 52 | adde[o][.] 
D 14 49 | addi 
D 12 SR 50 | addic 
D 13 SR 50 | addic. 
D 15 49 | addis 
XO 31 234 SR 52 | addme[o][.] 
XO 31 202 SR 53 | addze[o][.] 
X 31 28 SR 64 | and[.] 
X 31 60 SR 65 | andc[.] 
D 28 SR 62 | andi. 
D 29 SR 62 | andis. 
| 18 23 | b[I][a] 
B 16 CT 23 | be[l][a] 
XL 19 528 CT 24 | bectr[l] 
XL 19 16 CT 24 | belr[!] 
X 31 0 58 | cmp 
D 11 58 | cmpi 
X 31 32 59 | cmpl 
D 10 59 | cmpli 
X 31 58 SR 67 | cntlzd[.] 
X 31 26 SR 67 | cntlzw[.] 
XL 19 257 26 | crand 
XL 19 129 27 | crandc 
XL 19 289 27 | creqv 
XL 19 225 26 | crnand 
XL 19 33 27 | crnor 
XL 19 449 26 | cror 
XL 19 417 27 | crorc 
XL 19 193 26 | crxor 
X 31 86 Il} dcbf 
X 31 54 I| dcbst 
X 31 278 Il | debt 
X 31 246 Il | dcbtst 
X 31 1014 Il | dcbz 
XO 31 489 SR 56 | divd[o][.] 
XO 31 457 SR 57 | divdu[o][.] 
XO 31 491 SR 56 | divw[o][.] 
XO 31 459 SR 57 | divwu[o][.] 
X 31 310 I| | eciwx 


Instruction 


Add 

Add Carrying 

Add Extended 

Add Immediate 

Add Immediate Carrying 

Add Immediate Carrying and Record 
Add Immediate Shifted 

Add to Minus One Extended 

Add to Zero Extended 

AND 

AND with Complement 

AND Immediate 

AND Immediate Shifted 

Branch 

Branch Conditional 

Branch Conditional to Count Register 
Branch Conditional to Link Register 
Compare 

Compare Immediate 

Compare Logical 

Compare Logical Immediate 

Count Leading Zeros Doubleword 
Count Leading Zeros Word 
Condition Register AND 

Condition Register AND with Complement 
Condition Register Equivalent 
Condition Register NAND 

Condition Register NOR 

Condition Register OR 

Condition Register OR with Complement 
Condition Register XOR 

Data Cache Block Flush 

Data Cache Block Store 

Data Cache Block Touch 

Data Cache Block Touch for Store 
Data Cache Block set to Zero 
Divide Doubleword 

Divide Doubleword Unsigned 
Divide Word 

Divide Word Unsigned 

External Control In Word Indexed 
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Opcode 
rm 
Primary | Extend 

X 31 438 
X 31 854 
X 31 284 
X 31 954 
X 31 922 
X 31 986 
X 63 264 
A 63 21 
A 59 21 
X 63 846 
X 63 32 
X 63 0 
X 63 814 
X 63 815 
X 63 14 
X 63 15 
A 63 18 
A 59 18 
A 63 29 
A 59 29 
X 63 72 
A 63 28 
A 59 28 
A 63 25 
A 59 25 
X 63 136 
X 63 40 
A 63 31 
A 59 31 
A 63 30 
A 59 30 
A 59 24 
X 63 12 
A 63 26 
A 63 23 
A 63 22 
A 59 22 
A 63 20 
A 59 20 
X 31 982 
XL 19 150 
D 34 

D 35 

X 31 119 
X 31 87 
DS 58 0 
X 31 84 
DS 58 1 
X 31 53 
X 31 21 
D 50 

D 51 

X 31 631 
X 31 599 
D 48 

D 49 

X 31 567 


Mnemonic 


ecowx 
eieio 
eqv[.] 
extsb[.] 
extsh[.] 
extsw[.] 
fabs[.] 
fadd[.] 
fadds[.] 
fcfid[.] 
fcmpo 
fcmpu 
fctid[.] 
fctidz[.] 


fctiw[.] 
fctiwz[.] 
fdiv[.] 
fdivs[.] 
fmadd[.] 
fmadds[.] 
fmr[.] 
fmsub[.] 
fmsubs[.] 
fmul[.] 
fmuls[.] 
fnabs[.] 
fneg[.] 
fnmadd[.] 
fnmadds[.] 
fnmsub[.] 
fnmsubs[.] 
fres[.] 
frsp[.] 
frsqrte[.] 
fsel[.] 
fsqrit[.] 
fsqrts[.] 
fsub[.] 
fsubs[.] 
icbi 


Instruction 


External Control Out Word Indexed 
Enforce In-order Execution of I/O 
Equivalent 

Extend Sign Byte 

Extend Sign Halfword 

Extend Sign Word 

Floating Absolute Value 

Floating Add 

Floating Add Single 

Floating Convert From Integer Doubleword 
Floating Compare Ordered 

Floating Compare Unordered 

Floating Convert To Integer Doubleword 
Floating Convert To Integer Doubleword with round 
toward Zero 

Floating Convert To Integer Word 

Floating Convert To Integer Word with round toward Zero 
Floating Divide 

Floating Divide Single 

Floating Multiply-Add 

Floating Multiply-Add Single 

Floating Move Register 

Floating Multiply-Subtract 

Floating Multiply-Subtract Single 

Floating Multiply 

Floating Multiply Single 

Floating Negative Absolute Value 

Floating Negate 

Floating Negative Multiply-Add 

Floating Negative Multiply-Add Single 
Floating Negative Multiply-Subtract 
Floating Negative Multiply-Subtract Single 
Floating Reciprocal Estimate Single 
Floating Round to Single-Precision 
Floating Reciprocal Square Root Estimate 
Floating Select 

Floating Square Root 

Floating Square Root Single 

Floating Subtract 

Floating Subtract Single 

Instruction Cache Block Invalidate 
Instruction Synchronize 

Load Byte and Zero 

Load Byte and Zero with Update 

Load Byte and Zero with Update Indexed 
Load Byte and Zero Indexed 

Load Doubleword 

Load Doubleword And Reserve Indexed 
Load Doubleword with Update 

Load Doubleword with Update Indexed 
Load Doubleword Indexed 

Load Floating-Point Double 

Load Floating-Point Double with Update 
Load Floating-Point Double with Update Indexed 
Load Floating-Point Double Indexed 

Load Floating-Point Single 

Load Floating-Point Single with Update 
Load Floating-Point Single with Update Indexed 
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Opcode Mode | Page / : . 
1 Mnemonic Instruction 
Primary | Extend | Dep.’ | Bk 
X 31 535 99 | Ifsx Load Floating-Point Single Indexed 
D 42 34 | Iha Load Halfword Algebraic 
D 43 34 | lhau Load Halfword Algebraic with Update 
X 31 375 34 | Ihaux Load Halfword Algebraic with Update Indexed 
X 31 343 34 | lhax Load Halfword Algebraic Indexed 
X 31 790 42 | Ihbrx Load Halfword Byte-Reverse Indexed 
D 40 33 | Ihz Load Halfword and Zero 
D 41 33 | Ihzu Load Halfword and Zero with Update 
X 31 311 33 | Ihzux Load Halfword and Zero with Update Indexed 
X 31 279 33 | Ihzx Load Halfword and Zero Indexed 
D 46 44 | Imw Load Multiple Word 
X 31 597 46 | Iswi Load String Word Immediate 
X 31 533 46 | Iswx Load String Word Indexed 
DS 58 2 36 |lwa Load Word Algebraic 
X 31 20 Il | Iwarx Load Word And Reserve Indexed 
X 31 373 36 | Iwaux Load Word Algebraic with Update Indexed 
X 31 341 36 | lwax Load Word Algebraic Indexed 
X 31 534 42 | Iwbrx Load Word Byte-Reverse Indexed 
D 32 35 | Iwz Load Word and Zero 
D 33 35 | Iwzu Load Word and Zero with Update 
X 31 55 35 | Iwzux Load Word and Zero with Update Indexed 
X 31 23 35 | Iwzx Load Word and Zero Indexed 
XL 19 0 28 | mcrf Move Condition Register Field 
X 63 64 115 | mcrfs Move to Condition Register from FPSCR 
X 31 512 80 | merxr Move to Condition Register from XER 
XFX 31 19 80 | mfcr Move From Condition Register 
XFX 31 19 120 | mfcr Move From Condition Register (optional version) 
X 63 583 115 | mffs[.] Move From FPSCR 
X 31 83 Ill | mfmsr Move From Machine State Register 
XFX 31 339 79 | mfspr Move From Special Purpose Register 
X 31 595 32 IIl | mfsr Move From Segment Register 
X 31 659 32 III | mfsrin Move From Segment Register Indirect 
XFX 31 371 Il} mftb Move From Time Base 
XFX 31 144 80 | mtcrf Move To Condition Register Fields 
XFX 31 144 120 | mtcrf Move To Condition Register Field (optional version) 
X 63 70 117 | mtfsbO[.] Move To FPSCR Bit 0 
X 63 38 117 | mtfsb1[.] Move To FPSCR Bit 1 
XFL 63 711 116 | mtfsf[.] Move To FPSCR Fields 
X 63 134 116 | mtfsfi[.] Move To FPSCR Field Immediate 
X 31 146 Il} mtmsr Move To Machine State Register 
X 31 178 Ill} mtmsrd Move To Machine State Register Doubleword 
XFX 31 467 78 | mtspr Move To Special Purpose Register 
X 31 210 32 IIl | mtsr Move To Segment Register 
X 31 242 32 III | mtsrin Move To Segment Register Indirect 
XO 31 73 SR 55 | mulhd[.] Multiply High Doubleword 
XO 31 9 SR 55 | mulhdu[.] Multiply High Doubleword Unsigned 
XO 31 75 SR 55 | mulhw[.] Multiply High Word 
XO 31 11 SR 55 | mulhwu[.] Multiply High Word Unsigned 
XO 31 233 SR 54 | mulld[0][.] Multiply Low Doubleword 
D 7 54 | mulli Multiply Low Immediate 
XO 31 235 SR 54 | mullw[0][.] Multiply Low Word 
X 31 476 SR 64 | nand[.] NAND 
XO 31 104 SR 53 | neg[o][.] Negate 
X 31 124 SR 65 | nor[.] NOR 
X 31 444 SR 64 | or[.] OR 
X 31 412 SR 65 | orc[.] OR with Complement 
D 24 63 | ori OR Immediate 
D 25 63 | oris OR Immediate Shifted 
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Opcode Mode | Page / . 
di Primary| Extend |DeP- | Bk ONE 
XL 19 18 Ill rfid 
MDS 30 8 SR 71 | rldci[.] 
MDS 30 9 SR 72 | rldcr[.] 
MD 30 2 SR 70 | rldic[.] 
MD 30 0 SR 69 | rldicl[.] 
MD 30 1 SR 69 | rldicr[.] 
MD 30 3 SR 73 | rldimi[.] 
M 20 SR 73 | rlwimi[.] 
M 21 SR 70 | rlwinm[.] 
M 23 SR 72 | rlwnm[.] 
SC 17 25 | sc 
X 31 498 IIl | slbia 
X 31 434 Ill | slbie 
X 31 915 Ill} slomfee 
X 31 851 Ill} slomfev 
X 31 402 IIl | slbmte 
X 31 27 SR 74 | sld[.] 
X 31 24 SR 74 | slw[.] 
X 31 794 SR 77 | srad[.] 
XS 31 413 SR 76 | sradi[.] 
X 31 792 SR 77 | sraw[.] 
X 31 824 SR 76 | srawi[.] 
X 31 539 SR 75 | srd[.] 
X 31 536 SR 75 | srw[.] 
D 38 38 | stb 
D 39 38 | stbu 
X 31 247 38 | stbux 
X 31 215 38 | stbx 
DS 62 0 41 | std 
X 31 214 Il} stdcx. 
DS 62 1 41 | stdu 
X 31 181 41 | stdux 
X 31 149 41 | stdx 
D 54 103 | stfd 
D 55 103 | stfdu 
X 31 759 103 | stfdux 
X 31 727 103 | stfdx 
X 31 983 104 | stfiwx 
D 52 102 | stfs 
D 53 102 | stfsu 
X 31 695 102 | stfsux 
X 31 663 102 | stfsx 
D 44 39 | sth 
X 31 918 43 | sthbrx 
D 45 39 | sthu 
X 31 439 39 | sthux 
X 31 407 39 | sthx 
D 47 44 | stmw 
X 31 725 47 | stswi 
X 31 661 47 | stswx 
D 36 40 | stw 
X 31 662 43 | stwbrx 
X 31 150 ||| stwcx. 
D 37 40 | stwu 
X 31 183 40 | stwux 
X 31 151 40 | stwx 
XO 31 40 SR 50 | subf[o][.] 
XO 31 8 SR 51 | subfc[o][.] 


Instruction 


Return from Interrupt Doubleword 

Rotate Left Doubleword then Clear Left 

Rotate Left Doubleword then Clear Right 
Rotate Left Doubleword Immediate then Clear 
Rotate Left Doubleword Immediate then Clear Left 
Rotate Left Doubleword Immediate then Clear Right 
Rotate Left Doubleword Immediate then Mask Insert 
Rotate Left Word Immediate then Mask Insert 
Rotate Left Word Immediate then AND with Mask 
Rotate Left Word then AND with Mask 

System Call 

SLB Invalidate All 

SLB Invalidate Entry 

SLB Move From Entry ESID 

SLB Move From Entry VSID 

SLB Move To Entry 

Shift Left Doubleword 

Shift Left Word 

Shift Right Algebraic Doubleword 

Shift Right Algebraic Doubleword Immediate 
Shift Right Algebraic Word 

Shift Right Algebraic Word Immediate 

Shift Right Doubleword 

Shift Right Word 

Store Byte 

Store Byte with Update 

Store Byte with Update Indexed 

Store Byte Indexed 

Store Doubleword 

Store Doubleword Conditional Indexed 

Store Doubleword with Update 

Store Doubleword with Update Indexed 

Store Doubleword Indexed 

Store Floating-Point Double 

Store Floating-Point Double with Update 

Store Floating-Point Double with Update Indexed 
Store Floating-Point Double Indexed 

Store Floating-Point as Integer Word Indexed 
Store Floating-Point Single 

Store Floating-Point Single with Update 

Store Floating-Point Single with Update Indexed 
Store Floating-Point Single Indexed 

Store Halfword 

Store Halfword Byte-Reverse Indexed 

Store Halfword with Update 

Store Halfword with Update Indexed 

Store Halfword Indexed 

Store Multiple Word 

Store String Word Immediate 

Store String Word Indexed 

Store Word 

Store Word Byte-Reverse Indexed 

Store Word Conditional Indexed 

Store Word with Update 

Store Word with Update Indexed 

Store Word Indexed 

Subtract From 

Subtract From Carrying 
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Opcode Mode | Page / F 
Form 1 Mnemonic 
Primary | Extend |Dep.'| Bk 
XO 31 136 SR 52 | subfe[o][.] 
D 8 SR 51 | subfic 
XO 31 232 SR 52 | subfme[o][.] 
XO 31 200 SR 53 | subfze[o][.] 
X 31 598 Il | sync 
X 31 68 61 | td 
D 2 60 | tdi 
X 31 370 Ill | tlbia 
X 31 306 64 IIl | tlbie 
X 31 566 111 | tlbsync 
X 31 4 61 | tw 
D 3 60 | twi 
X 31 316 SR 64 | xor[.] 
D 26 63 | xori 
D 27 63 | xoris 


Instruction 


Subtract From Extended 

Subtract From Immediate Carrying 
Subtract From Minus One Extended 
Subtract From Zero Extended 
Synchronize 

Trap Doubleword 

Trap Doubleword Immediate 

TLB Invalidate All 

TLB Invalidate Entry 

TLB Synchronize 

Trap Word 

Trap Word Immediate 

XOR 

XOR Immediate 

XOR Immediate Shifted 


TKey to Mode Dependency Column 


Except as described below and in Section 1.12.2, 
“Effective Address Calculation” on page 15, all 
instructions are independent of whether the processor 
is in 32-bit or 64-bit mode. 


CT If the instruction tests the Count Register, it 
tests the low-order 32 bits in 32-bit mode and 
all 64 bits in 64-bit mode. 

SR The setting of status registers (such as XER 


and CRO) is mode-dependent. 


32 The instruction must be executed only in 
32-bit mode. 

64 The instruction must be executed only in 
64-bit mode. 
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Index 


[^] 


abit 20 

A-form 9 

AA field 9 

address 14 
effective 15 

assembler language 
extended mnemonics 
mnemonics 143 
symbols 143 


B-form 7 
BA field 
BB field 
BD field 
BF field 
BFA field 9 

BH field 9 

Blfield 9 

Big-Endian 124 

BO field 9,20 
boundedly undefined 3 
BT field 9 

byte ordering 124 
bytes 2 


XO «o co «o 


D field 10 

D-form 7 

defined instructions 11 
denormalization 87 


denormalized number 86 
double-precision 88 
doublewords 2 

DS field 10 

DS-form 7 


E] 


EA 15 
effective address 15 
EQ 18,19 


FF| 


facilities 
optional 13 
FE 19, 84 
FEX 83 
FG 19, 84 
FI 84 
FL 18, 84 
FLM field 10 
floating-point 
denormalization 87 
double-precision 88 
exceptions 82, 89 
inexact 94 
invalid operation 91 
overflow 93 
underflow 93 
zero divide 92 
execution models 94 
normalization 87 


number 
denormalized 86 
infinity 86 


normalized 86 
nota number 86 
zero 86 
rounding 88 
sign 87 
single-precision 88 
FPCC 84 
FPR 82 
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FPRF 84 
FPSCR 83 i 
C 8 
FE 84 form 6 
FEX 83 illegal instructions 11 
FG 84 inexact 94 
FI 84 infinity 86 
FL 84 instruction 
FPCC 84 fields 9, 10, 11 
FPRF 84 AA 9 
FR 84 BA 9 
FU 84 BB 9 
FX 83 BD 9 
OE 84 BF 9 
OX 83 BFA 9 
RN 85 BH 9 
UE 84 BI 9 
UX 83 BO 9 
VE 84 BT 9 
VX 83 D 10 
VXCVI 84 DS 10 
VXIDI 83 FLM 10 
VXIMZ 84 FRA 10 
VXISI 83 FRB 10 
VXSNAN 83 FRC 10 
VXSOFT 84 FRS 10 
VXSQRT 84 FRT 10 
VXVC 84 FXM 10 
VXZDZ 84 L 10 
XE 84 LEV 10 
XX 83 LI 10 
ZE 84 LK 10 
ZX 83 MB 10 
FR 84 ME 10 
FRA field 10 NB 10 
FRB field 10 OE 10 
FRC field 10 RA 10 
FRS field 10 RB 10 
FRT field 10 Rc 10 
FU 19,84 RS 10 
FX 83 RT 10 
FXM field 10 SH 10 
SI 10 
SPR 10 
[e | SR 10 
TBR 11 
GPR 29 TH 11 
GT 18,19 TO 11 
Gulliver's Travels 124 U 11 
Ul 11 
XO 11 
[H] formats 6, 7, 8, 9 
halfwords 2 pn : 
"M -form 7 
hardware description language 4 D-form 7 
DS-form 7 
l-form 6 
M-form 9 
MD-form 9 
MDS-form 9 
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instruction (continued) 
formats (continued) 
SC-form 7 
X-form 8 
XFL-form 8 
XFX-form 8 
XL-form 8 
XO-form 8 
XS-form 8 
instructions 
classes 11 
defined 11 
forms 12 
illegal 11 
invalid forms 12 
optional 13 
preferred forms 12 
reserved 11 
invalid instruction forms 12 
invalid operation 91 


L field 10 

language used for instruction operation description 4 
LEV field 10 

Ll field 10 

Little-Endian 124 

LK field 10 

LR 19 

LT 18 


[m] 


M-form 9 

MB field 10 

MD-form 9 

MDS-form 9 

ME field 10 

mnemonics 
extended 143 


IN. 


NB field 10 

NIA 4 

no-op 63 
normalization 87 
normalized number 86 
not a number 86 


[o| 


OE 84 


OE field 10 

optional facility 13 
optional instruction 13 
OV 30 

overflow 93 

OX 83 


al 


preferred instruction forms 12 


a 


quadwords 2 


[n 


RA field 10 

RB field 10 

Rc field 10 

register transfer level language 4 

registers 
Condition Register 18 
Count Register 19 
Fixed-Point Exception Register 30 
Floating-Point Registers 82 
Floating-Point Status and Control Register 83 
General Purpose Registers 29 
Link Register 19 

reserved field 3 

reserved instructions 11 


RN 85 

rounding 88 

RS field 10 

RT field 10 

RTL 4 

SC-form 7 
sequential execution model 17 
SH field 10 

SI field 10 

sign 87 
single-precision 88 
SO 18, 19, 30 

split field notation 6 
SPR field 10 

SR field 10 


storage access 
floating-point 98 
storage address 14 
Swift, Jonathan 124 
symbols 143 
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zero divide 92 
ZX 83 


t bit 20 
TBR field 11 
TH field 11 
TO field 11 


O 


Ufield 11 

UE 84 

Ul field 11 

undefined 
boundedly 3 

underflow 93 

UX 83 


VXISI 83 
VXSNAN 83 
VXSOFT 84 
VXSQRT 84 
VXVC 84 
VXZDZ 84 


X-form 8 
XE 84 
XER 30 
XFL-form 8 
XFX-form 8 
XL-form 8 
XO field 11 
XO-form 
XS-form 8 
XX 83 


[z 


zbit 20 
ZE 84 
zero 86 
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